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S 46 sheet(s) of drawings. 
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filing purposes.) 
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7. P Preliminarily, please amend the specification by inserting before the first line the following: 

-This application is a ^ continuation of Serial No. 08/771,777, filed 12/20/96, which is a continuation in part of 08/706,945, 
filed 9/3/96, which is a continuation in part of 08/577,788, filed 12/22/95 which is hereby incorporated by reference.- 

8. □ Transfer the drawings from the prior application to this application and abandon said prior application as of the filing 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Applicant{s): Boyle et al. 
Serial No.: Continuation of 09/1 32,985 
Filed: September 24, 1999 
For: Osteoprotegerin 
Docket No.: A-378-CIP2C2 

PRELIMINARY AMENDMENT 

This preliminary amendment accompanies the filing of a continuation application under 37 

CFR 1.53(b). 

AMENDMENT 

Please amend the application as follows: 
In the specification: 

At p. 1 , line 1 , Insert the following: 

- - This application is a continuation of Serial No. 09/132,985 filed August 12, 1998, which 
is a continuation of Serial No. 08/771 ,777, filed December 20, 1 996, which is a continuation In part of 
08/706,945, filed September 3, 1996, which is a continuation in part of 08/577,788, filed December 22, 
1995 which Is hereby incorporated by reference - -. 

In the claims: 

Please cancel Claims 1-60 without prejudice and add the following new claims. 

- 61 . (new) A method for treating bone loss In a mammal comprising administering to the 
mammal an expression vector comprising a nucleic acid sequence encoding osteoprotegerin. 

62. (new) The method of Claim 61 wherein the nucleic acid sequence Is selected from the 
group consisting of: 

a) a nucleic acid encoding a polypeptide comprising the amino acid sequence from 
residues 1 to 401 or from residues 22 to 401 as shown in Figure 9C-9D (SEQ ID NO:124); 
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b) a nucleic acid enccxiing the polypeptide as in (a) wherein the polypeptide comprises 
from 1 to 21 6 amino acid residues deleted from the carboxy terminus; and 

c) a nucleic acid which hybridizes under high stringency conditions with the nucleic acid set 
forth in (a) and (b) wherein the hybridizing nucleic acid encodes a polypeptide having the activity of 
inhibiting bone resorption. 

63. (new) The method of Claim 62 wherein the nucleic acid encodes a polypeptide 
comprising residues 22-185, 22-189, 22-194, or 22-201 inclusive as shown in Figure 9C-9D (SEQ ID 
NO:124). 

64. (new) The method of Claim 62 wherein the nucleic acid further comprises an Fc region 

of human IgG. 

65. (new) The method of Claim 61 wherein the expression vector is a viral vector. 

66. (new) The method of Claim 61 wherein the expression vector further comprises a 
pharmaceutically acceptable adjuvant. 

67. (new) The method of Claim 61 wherein the bone loss is due to osteoporosis. 

68. (new) The method of Claim 61 wherein the bone loss is due to rheumatoid arthritis. 

REMARKS 

Applicants have canceled Claims 1-60 which are pending in the parent application, U.S. 
Serial No. 09/132,985 without prejudice and reserve the right to introduce claims relating to the subject 
matter therein in timely filed continuation applications. 

Claims 61-68 have been added which are directed to treatment of bone loss by 
administering an expression vector comprising a nucleic acid encoding osteoprotegerin. It is believed that 
the claims do not introduce new matter or new issues requiring search. Entry of the new claims is 
respectfuily requested. 
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CONCLUSION 



It is submitted tliat Claims 61-68 are in condition for allowance and an early notice thereof 
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Robert B. Winter 
Attorney/Agent for Applicant(s) 
Registration No.: 34,458 
Phone: (805)447-2425 
Date: September 24, 1999 
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OSTEOPROTEGERIN 



Field of the Invention 

VThe invention relates generally to 
5 polypeptides involved in the regulation of bone 

metabolism. More particularly, the invention relates 
to a novel polypeptide, termed osteoprotegerin, which 
is a member of the tumor necrosis factor receptor 
super family. The polypeptide is used to treat bone 
10 diseases characterized by increased bone loss such as 
osteoporosis , 

Background of the Invention 

Polypeptide growth factors and cytokines are 

15 secreted factors which signal a wide variety of changes 
in cell growth, differentiation, and metabolism, by 
specifically binding to discrete, surface bound 
receptors. As a class of proteins, receptors vary in 
their structure and mode of signal transduction. They 

20 are characterized by having an extracellular domain 
that is involved in ligand binding, and cytoplasmic 
domain which transmits an appropriate intracellular 
signal. Receptor expression patterns ultimately 
determine which cells will respond to a given ligand, 

25 while the structure of a given receptor dictates the 

cellular response induced by ligand binding. Receptors 
have been shown to transmit intracellular signals via 
their cytoplasmic domains by activating protein 
tyrosine, or protein serine/ threonine phosphorylation 

30 (e.g., platelet derived growth factor receptor (PDGFR) 
or transforming growth factor-P receptor-I (TGFpR-I) , 
by stimulating G-protein activation (e.g., P-adrenergic 
receptor) , and by modulating associations with 
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cytoplasmic signal transducing proteins (e.g., TNFR-1 
and Fas/APO) (Heldin, Cell 80. 213-223 (1995)). 

The tumor necrosis factor receptor (TNFR) 
5 superfamily is a group of type I transmembrane proteins 
which share a conserved cysteine-rich motif which is 
repeated three to six times in the extracellular domain 
(Smith, et al. Cell 76, 953-962 (1994)). Collectively, 
these repeat units form the ligand binding domains of 

10 these receptors (Chen et al.. Chemistry 270 . 2874-2878 
(1995)). The ligands for these receptors are a 
structurally related group of proteins homologous to 
TNFa, (Goeddel et al. Cold Spring Harbor Symp. Quart. 
Biol. 51, 597-609 (1986); Nagata et al. Science 267 . 

15 1449-1456 (1995)). TNFtt binds to distinct, but closely 
related receptors, TNFR-1 and TNFR-2. TNFa produces a 
variety of biological responses in receptor bearing 
cells, including, proliferation, differentiation, and 
cytotoxicity and apoptosis (Beutler et al. Ann. Rev. 

20 Biochem. 57, 505-518 (1988)). 

TNFa is believed to mediate acute and chronic 
inflammatory responses (Beutler et al . Ann. Rev. 
Biochem, 57, 505-508 (1988)), Systemic delivery of TNFa 
induces toxic shock and widespread tissue necrosis. 

25 Because of this, TNFa may be responsible for the severe 
morbidity and mortality associated with a variety of 
infectious diseases, including sepsis. Mutations in 
FasL, the ligand for the TNFR- related receptor Fas/APO 
(Suda et al. Cell 75, 1169-1178 (1993)), is associated 

30 with autoimmunity (Fisher et al. Cell 81, 935-946 

(1995)), while oveirproduction of FasL may be implicated 
in drug-induced hepatitis. Thus, ligands to the 
various TNFR-related proteins often mediate the serious 
effects of many disease states, which suggests that 

35 agents that neutralize the activity of these ligands 
would have therapeutic value. Soluble TNFR-1 
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receptors, and antibodies that bind TNFa, have been 
tested for their ability to neutralize systemic TNFa 
(Loetscher et al. Cancer Cells 3 (6K 221-226 (1991)), 
A naturally occurring form of a secreted TNFR-1 mRNA 
5 was recently cloned, and its product tested for its 
ability to neutralize TNFa activity in vitro and in 
vivo (Kohno et al. PNAS USA 87, 8331-8335 (1990)). The 
ability of this protein to neutralize TNFa suggests 
that soluble TNF receptors function to bind and clear 

10 TNF thereby blocking the cytotoxic effects on TNFR- 
bearing cells. 

An object of the invention to identify new 
members of the TNFR super family. It is anticipated 
that new family members may be transmembrane proteins 

15 or soluble forms thereof comprising extracellular 
domains and lacking transmembrane and cytoplasmic 
domains. We have identified a new member of the TNFR 
superfamily which encodes a secreted protein that is 
closely related to TNFR-2, By analogy to soluble TNFR- 

20 1, the TNFR-2 related protein may negatively regulate 
the activity of its ligand, and thus may be useful in 
the treatment of certain human diseases. 

Summary of the Invention 

25 A hovel member of the tumor necrosis factor 

receptor (TNFR) superfamily has been identified from a 
fetal rat intestinal cDNA library. A full-length cDNA 
clone was obtained and sequenced. Expression of the 
rat cDNA in a transgenic mouse revealed a marked 

30 increase in bones density, particularly in long bones, 
pelvic bone and vertebrae. The polypeptide encoded by 
the cDNA is termed Osteprotegerin (OPG) and plays a 
role in promoting bone accumulation. 

The invention provides for nucleic acids 

35 encoding a polypeptide having at least one of the 
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biological activities of OPG. Nucleic acids which 
hybridize to nucleic acids encoding mouse, rat or human 
OPG as shown in Figures 2B-2C (SEQ ID NO:120), 9A-9B 
(SEQ ID NO: 122), and 9C-9D (SEQ ID NO: 124) are also 
5 provided. Preferably, OPG is mammalian OPG and more 
preferably is human OPG, Recombinant vectors and host 
cells expressing OPG are also encompassed as are 
methods of producing recombinant OPG, Antibodies or 
fragments thereof which specifically bind the 

10 polypeptide are also disclosed. 

Methods of treating bone diseases are also 
provided by the invention. The polypeptides are useful 
for preventing bone resorption and may be used to treat 
any condition resulting in bone loss such as 

15 osteoporosis, hypercalcemia, Paget 's disease of bone, 
and bone loss due to rheumatoid arthritis or 
osteomyelitis, and the like. Bone diseases may also be 
treated with anti-sense or gene therapy using nucleic 
acids of the invention. Pharmaceutical compositions 

20 comprising OPG nucleic acids and polypeptides are also 
encompassed. 

Description of the Figures 

25 Figure 1. A. FASTA analysis of novel EST LORF. Shown 
is the deduced FRI-1 amino acid sequence aligned to the 
human TNFR-2 sequence. B. Profile analysis of the 
novel EST LORF shown is the deduced FRI-1 amino acid 
sequence aligned to the TNFR-prof ile. C. Structural 

3 0 view of TNFR superfamily indicating region which is 
homologous to the novel FRI-1. 

Figure 2 . Structure and sequence of full length rat 
OPG gene, a novel member of the TNFR superfamily, A. 
35 Map of pMOB-Bl.l insert. Box indicates position of 

LORF within the cDNA sequence (bold line) . Black box 
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indicates signal peptide, and gray ellipses indicate 
position of cysteine-rich repeat sequences* B, C, 
Nucleic acid and protein sequence of the Rat OPG cDNA. 
The predicted signal peptide is underlined, and 
5 potential sites of N-linked glycosylation are indicated 
in bold, underlined letters. D, E. Pileup sequence 
comparison (Wisconsin GCG Package, Version 8.1) of OPG 
with other members of the TNFR superfamily, fas (SEQ ID 
NO:128); tnfrl (SEQ ID NO: 129); sfu-t2 (SEQ ID 
10 NO:130); tnfr2 (SEQ ID N0:131); cd40 (SEQ ID NO:132); 

osteo (SEQ ID NO:133); ngfr (SEQ ID NO:134); ox4Q (SEQ 
ID NO:135); 41bb (SEQ ID NO:136), 

Figure 3. PepPlot analysis (Wisconsin GCG Package, 

15 Version 8.1) of the predicted rat OPG protein sequence. 
A, Schematic representation of rat OPG showing 
hydrophobic (up) and hydrophilic (down) amino acids. 
Also shown are basic (up) and acidic (down) amino 
acids. B. Display of amino acid residues that are 

20 beta-sheet forming (up) and beta-sheet breaking down) 
as defined by Chou and Fasman (Adv. Enz. 42/ 45-147 
(1948)). C. Display of propensity measures for alpha- 
helix and beta-sheet (Chou and Fasman, ibid ) , Curves 
above 1.00 show propensity for alpha-helix or beta- 

25 sheet structure. Structure may terminate in regions of 
protein where curves drop below 1.00. D. Display of 
residues that are alpha-forming (up) or alpha-breaking 
(down) . E. Display of portions of the protein sequence 
that resemble sequences typically found at the amino 

30 end of alpha and beta structures (Chou and Fasman, 

ibid) . F. Display of portions of the protein sequence 
that resemble sequences typically found at the carboxyl 
end of alpha and beta structures (Chou and Fasman, 
ibid) . G* Display of portions of the proteins sequence 

3 5 typically found in txxrns (Chou and Fasman, ibid ) H. 

Display of the helical hydrophobic moment (Eisenberg et 
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al. Proc, Natl. Acad. Sci. USA 81, 140-144 (1984)) at 
each position in the sequence. I. Display of average 
hydrophathy based upon Kyte and Doolittle (J. Mol . 
Biol. 157, 105«*132 (1982)) and Goldman et al. (reviewed 
5 in Ann. Rev. Biophys. Biophys . Chem. 15, 321-353 
(1986) ) . 

Figure 4. mRNA expression patterns for the OPG cDNA in 
human tissues. Northern blots were probed with a 32P- 
10 labeled rat cDNA insert (A, left two panels), or with 
the human cDNA insert (B, right panel) . 

Figure 5. Creation of transgenic mice expressing the 
OPG cDNA in hepatocytes. Northern blot expression of 
15 HE-OPG transgene in mouse liver. 

Figure 6, Increase in bone density in OPG transgenic 
mice. Panel A-F. Control Mice. G-J, OPG expressing 
mice. At necropsy, all animals were radiographed and 

20 photographs prepared. In A-F, the radiographs of the 
control animals and the one transgenic non-expressor 
(#28) are shown. Note that the bones have a clearly 
defined cortex and a lucent central marrow cavity. In 
contrast, the OPG (G-J) animals have a poorly defined 

25 cortex and increased density in the marrow zone. 

Figure 7 , Increase in trabecular bone in OPG 
transgenic mice. A-D. Representative photomicrographs 
of bones from control animals. In A and B, low (4X, 

3 0 lOX) power images of the femurs are shown (Masson 

Trichrome stain) . Stains for tartrate resistant acid 
phosphatase (TRAP) demonstrate osteoclasts (see arrows) 
both resorbing cartilage (C) and trabecular bone (D) . 
Note the flattened appearance of osteoclasts on 

3 5 trabecular bone. E-H* Representative 

photomicrographs of bones from OPG-expressing animals. 
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In E and F, low (4X, lOX) power images of the femurs 
are shown (Masson Trichrome stain) . The clear region is 
the growth plate cartilage, blue stained area is bone, 
and the red area is marrow. Note that in contrast to 
5 the controls, the trabecular bone has not been resorbed 
resulting in the absence of the usual marrow cavity. 
Also, the resulting trabeculae have a variegated 
appearance with blue and clear areas. The clear areas' 
are remnants of growth plate cartilage that have never 

10 been remodelled. Based on TRAP stains, these animals do 
have osteoclasts (see arrows) at the growth plate (G) , 
which may be reduced in number. However, the surfaces 
of the trabeculae away from the growth plate are 
virtually devoid of osteoclasts (H) , a finding that 

15 stands in direct contrast with the control animals .(see 
D) . 

Figure 8 . HE-OPG expressors do not have a defect in 
monocyte -macrophage development. One cause for 

20 osteopetrosis in mice is defective M-CSF production due 
to a point mutation in the M-CSF gene. This results in 
a marked deficit of circulating and tissue based 
macrophages. The peripheral blood of OPG expressors 
contained monocytes as assessed by HIE analysis. To 

25 affirm the presence of tissue macrophages, 

immnohistochemistry was performed using F480 
antibodies, which recognize a cell surface antigen on 
murine macrophages. A and C show low power (4X) 
photomicrographs of the' spleens from normal and CRl 

3 0 overexpressors . Note that both animals have numerous 
F480 positive cells. Monocyte -macrophages were also 
present in the marrow of normal (B) and HE-OPG 
overexpressors (D) {40X) . 

35 Figure 9. ' Structure and sequence of mouse and human 
OPG cDNA clones. A, B. Mouse cDNA and protein 
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sequence. C, D. Human cDNA and protein sequence. The 
predicted signal peptides are underlined, and potential 
sites of N-linked glycosylation are indicated in bold. 
E, F. Sequence alignment and comparison of rat, mouse 
5 and human OPG amino acid sequences. 

Figure 10. Comparison of conserved sequences in 
extracellular domain of TNFR-1 and human OPG. 
PrettyPlot (Wisconsin GCG Package, Version 8.1) of the 
10 TNFRl and OPG alignment described in example 6. Top 
line, h\iman TNFRl sequences encoding domains 1-4, 
Bottom line, hxaman OPG sequences encoding domains 1-4. 
Conserved residues are highlighted by rectangular 
boxes . 

15 

Figure 11. Three-dimensional representation of human 
OPG, Side-view of the Molescript display of the 
predicted 3 -dimensional structure of human OPG residues 
25 through 163, (wide line), co-crystallized with huanan 

20 TNFP (thin line) . As a reference for orientation, the 
bold arrows along the OPG polypeptide backbone are 
pointing in the N-terminal to C-terminal direction. 
The location of individual cysteine residue side chains 
are inserted along the polypeptide backbone to help 

25 demonstrate the separate cysteine-rich domains. The 
TNFP molecule is aligned as described by Banner et al . 
(1993). 

Figure 12. Structure of OPG cysteine-rich domains. 

3 0 Alignment of the human (top line SEQ ID NO: 13 6) and 
mouse (bottom line) OPG amino acid sequences 
highlighting the predicted domain structure of OPG. 
The polypeptide is divided into two halves; the N- 
terminus (A) , and C-terminus (B) . The N-terminal half 

35 is predicted to contain four cysteine rich domains 
(labeled 1-4) , The predicted intrachain disulfide 
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bonds are indicated by bold lines, labeled ^'SSl", 
"332", or "353". Tyrosine 28 and histidine 75 
(underlined) are predicted to form an ionic 
interaction. Those amino acids predicted to interact 
5 with an OPG ligand are indicated by bold dots above the 
appropriate residue. The cysteine residues located in 
the C-terminal half of OPG are indicated by rectangular 
boxes . 

10 Figure 13 . Expression and secretion of full length and 
truncated mouse OPG-Fc fusion proteins. A, Map 
indicating points of fusion to the human IgGl Fc domain 
are indicated by arrowheads. B. Silver stain of a 
SDS-polyacrylamide gel of conditioned media obtained 

15 from cells expressing either Fl.Fc (Full length OPG 

fused to Fc at Leucine 401) or CT.Fc (Carboxy- terminal 
truncated OPG fused to Fc at threonine 180) fusion 
protein expression vectors*. Lane 1, parent pCEP4 
expression vector cell line; Lane 2, Fl.Fc vector cell 

2 0 line; . Lane 3, CT.Fc vector cell line. C. Western 

blot of conditioned media obtained from Fl.Fc and CT.Fc 
fusion protein expression vectors probed with anti- 
human IgGl Fc domain (Pierce) . Lane 1, parent pCEP4 
expression vector cell line; Lane 2, Fl.Fc vector cell 
25 line; Lane "3 , CT.Fc vector cell line. 

Figure 14. Expression of human OPG in E. coli. A. 
Construction of a bacterial expression vector. The 
LORF of the human OPG gene was amplified by PGR, then 

3 0 joined to a oligonucleotide linker fragment (top strand 

is SEQ ID NO:137; bottom strand is SEQ ID NO:127), and 
ligated into pAMG21 vector DNA, The resulting vector 
is capable of expressing OPG residues 32-401 linked to 
a N-terminal methionine residue. B SDS-PAGE analysis 
35 of uninduced and induced bacterial harboring the 
pAMG21 -human OPG -32-401 plasmid. Lane 1, MW 
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standards; lane 2, uninduced bacteria; lane 3, 30°C 
induction; lane 4, 37^C induction; lane 5, whole cell 
lysate from 37°C induction; lane 6, soluble fraction of 
whole cell lysate; lane 1, insoluble fraction of whole 
5 cell lysate; lane 8, purified inclusion bodies obtained 
from whole cell lysate. 

Figure 15. Analysis of recombinant murine OPG produced 
in CHO cells by SDS-PAGE and western blotting. An 

10 equal amount of CHO conditioned media was applied to 
each lane shown, and was prepared by treatment with 
either reducing sample buffer (left lane) , or non- 
reducing sample buffer {right lane) , After 
electrophoresis, the resolved proteins were transferred 

15 to a nylon membrane, then probed with anti-OPG 

antibodies. The relative positions of the 55 kd 
monomer ic and 100 kd dimeric forms of OPG are indicated 
by arrowheads . 

20 Figure 16. Pulse-chase analysis of recombinant murine 
OPG produced in CHO cells. CHO cells were 
pulse-labeled with ^^S-methionine/cysteine/ then chased 
for the indicated time. Metabolically labeled cultures 
were separated into both conditioned media and cells, 

25 and detergent extracts wete prepared from each, 
clarified, then immunoprecipitated with anti-OPG 
antibodies. The immunoprecipitates were the resolved 
by SDS-PAGE, and exposed to film. Top left and right 
panels; samples analyzed under non-reducing conditions. 

3 0 Lower left and right panels; samples analyzed under 

reducing conditions. Top and bottom left panels; Cell 
extracts. Top and bottom right panels; Conditioned 
media extracts. The relative mobility of the 55 kd 
monomeric and 100 kd dimeric forms of OPG are indicated 

35 by arrowheads . 
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Figure 17. Expression of OPG in the CTLL-2 cell line. 
Serum- free conditioned media from CTLL-2 cells and CHO- 
mu OPG [1-401] transfected cells was prepared, 
concentrated, then analyzed by non-reducing SDS-PAGE 
5 and western blotting. Left lane; CTLL-2 conditioned 
media. Right lane; CHO-muOPG conditioned media. The 
relative mobility of the 55 kd monomeric and 100 kd 
dimeric forms of OPG are indicated by arrowheads. 

10 Figure 18. Detection of OPG expression in serum 

samples and liver extracts obtained from control and 
OPG transgenic mice. Transgenic mice were constructed 
as described in Example 4. OPG expression was 
visualized after SDS-PAGE followed by Western blotting 

15 using anti-OPG antibodies. 

Figure 19. Effects of huOPG [22-401] -Fc fusion protein 
on osteoclast formation in vitro . The osteoclast 
forming assay was performed as described in Example llA 

20 in the absence (control) or presence of the indicated 
amounts of huOPG [22-401] -Fc fusion. Osteoclast 
formation was visualized by histochemical staining for 
tartrate acid phosphatase (TRAP) . ) , A. OPG added to 
100 ng/ml. D, OPG added to 0.1 ng/ml. E. OPG added 

25 to 0.01 ng/ml. F. OPG added to 0.001 ng/ml. G. 
Control. No OPG added. 

Figure 20. Decrease in osteoclast culture TRAP 
activity with increasing amounts of OPG. Indicated 
30 concentrations of huOPG [22-401] -Fc fusion protein were 
added to osteoclast forming assay and TRAP activity 
quant itated as described in Example llA. 

Figure 21. Effect of OPG on a terminal stage of 
35 osteoclast' differentiation. ^huOPG [22-401] -Fc fusion 
was added to the osteoclast forming assay during the 
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intermediate stage of osteoclast maturation (days 5-6; 
OPG-CTL) or during the terminal stage of osteoclast 
maturation (days 7-15; CTL-OPG) . TRAP activity was 
quantitated and compared with the activity observed in 
5 the absence of OPG (CTL-CTL) in the presence of OPG 
throughout (OPG-OPG) . 

Figure 22. Effects of IL-l^, IL-la and OPG on blood - 
ionized calcium in mice. Levels of blood ionized 

10 calcium were monitored after injection of IL-ip alone, 
IL-la alone, IL-iP plus muOPG [22-401] -Fc, IL-la plus 
MuOPG [22-401J-FC, and muOPG [22-401]-Fc alone. 
Control mice received injections of phosphate buffered 
saline (PBS) only. IL-IB experiment shown in A; IL-la 

15 experiment shown in B» 

Figure 23 . Effects of OPG on calvarial osteoclasts in 
control and ILl-treated mice. Histological methods for 
analyzing mice calvarial bone samples are described in 

20 Example llB, Arrows indicate osteoclasts present in 
day 2 -treated mice. Calvarial samples of mice 
receiving four PBS injections daily (A) , one injection 
of IL-1 and three injections of PBS daily (B) , one 
injection of PBS and three injections of OPG daily (C) , 

25 one injection of IL-1 and .three injections of OPG 
daily. 

Figure 24. Radiographic analysis of bone accumulation 
in marrow cavity of normal mice. Mice were injected 
30 subcutaneous ly with saline (A) or muOPG [22-401] -Fc 
fusion (5mg/kg/d) for 14 days (B) and bone density 
determined as described in Example IIC. 

Figure 25. Histomorphometric analysis of bone 
3 5 accumulation in marrow cavity of normal mice. 
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Injection experiments and bone histology performed as 
described in Example IIC. 

Figure 26. Histology analysis of bone acciamulation in 
5 marrow cavity of normal mice. Injection experiments 
and bone histology performed as described in Example 
lie. A. Saline injection B, Injection of muOPG [22- 
401] -Fc fusion. 

10 Figure 27. Activity of OPG administered to 

ovariectomized rats. In this two week experiment the 
trend to reduced bone density appears to be blocked by 
OPG or other anti-resorptive therapies. DEXA 
measurements were taken at time of ovariectomy and at 

15 week 1 and week 2 of treatment. The results are 

expressed as % change from the initial bone density 
(Mean +/- SEM) . 

Figure 28. Bone density in the femoral metaphysis, 
20 measured by histomorphometric methods / tends to be 

lower in ovariectomized rats (OVX) than sham operated 
animals (SHAM) 17 days following ovariectomy. This 
effect was blocked by OPG-Fc, with OPG-Fc treated 
ovariectomized rats* (OVX+OPG) having significantly 
25 higher bone ^density than vehicle treated ovariectomized 
rats (OVX) . (Mean +/- SEM) . 



Detailed Description of the Invention 

3 0 A novel member of the tumor necrosis factor 

receptor (TNFR) superfamily was identified as an 
expressed sequence tag (EST) isolated from a fetal rat 
intestinal cDNA library . The structures of the full- 
length rat cDNA clones and the corresponding mouse and 

3 5 human cDNA clones were determined as described in 

Examples 1 and 6. The rat, mouse and human genes are 
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shown in Figures 2B-2C (SEQ ID NO:120), 9A-9B (SEQ ID 
NO:122), and 9C-9D (SEQ ID NO:124), respectively. All 
three sequences showed strong similarity to the 
extracellular domains of TNFR family members. None of 
5 the full-length cDNA clones isolated encoded 

transmembrane and cytoplasmic domains that would be 
expected for membrane -bound receptors, suggesting that 
these cDNAs encode soluble, secreted proteins rather 
than cell surface receptors. A portion of the human 

10 gene spanning nucleotides 1200-1353 shown in Figure 9D 
was deposited in the Genebank database on November 22, 
1995 under accession no. 17188769. 

The tissue distribution of the rat and human 
mRNA was determined as described in Example 2. In rat, 

15 mRNA expression was detected in kidney, liver, placenta 
and heart with the highest expression in the kidney. 
Expression in skeletal muscle and pancreas was also 
detected. In humans, expression was detected in the 
same tissues along with lymph node, thymus, spleen and 

20 appendix. 

The rat cDNA was expressed in transgenic mice 
(Example 3) using the liver-specific ApoE promoter 
expression system. Analysis of expressors showed a 
marked increase in bone density, particularly in long 

25 bones (femurs), vertebrae- and flat bones (pelvis). 
Histological analysis of stained sections of bone 
showed severe osteopetrosis (see Example 4) indicating 
a marked imbalance between bone formation and 
resorption which has led to a marked accumulation of 

30 bone and cartilage. A decrease in the number of 

trabecular osteoclasts in the bones of OPG expressor 
animals indicate that a significant portion of the 
activity of the TNFR-related protein may be to prevent 
bone resorption, a process mediated by osteoclasts. In 

3 5 view of the activity in transgenic expressors, the 

TNFR-related proteins described herein are termed GPGs . 
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Using the rat cDNA sequence, mouse and hiiman 
cDNA clones were isolated (Example 5) . Expression of 
mouse OPG in 293 cells and human OPG in E, coli is 
described in Examples 7 and 8 . Mouse OPG was produced 
5 as an Fc fusion which was purified by Protein A 

affinity chromatography. Also described in Example 7 
is the expression of full-length and truncated hioman 
and mouse OPG polypeptides in CHO and 293 cells either 
as fusion polypeptides to the Fc region of human IgGl 

10 or as unfused polypeptides. The expression of full- 
length and truncated hiiman and mouse OPGs in E. coli 
either as Fc fusion polypeptides or as unfused 
polypeptides is described in Example 8. Purification 
of recombinantly produced mammalian and bacterial OPG 

15 is described in Example' 10. 

The biological activity of OPG was determined 
using an in vitro osteoclast maturation assay, an in 
vivo model of interleukin-1 (IL-l) induced 
hypercalcemia, and injection studies of bone density in 

20 normal mice (see Example 11) . The following OPG 
recombinant proteins produced in CHO or 293 cells 
demonstrated activity in the in E. coli osteoclast 
maturation assay: muOPG [22-185] -Fc, muOPG [22-194] -Fc, 
muOPG [22-401]Fc, muOPG [22-401], huOPG [22-201] -Fc, 

25 huOPG [22-4011 -Fc. muOPG - [22-180] -Fc produced in CHO 
cells and huOPG met [32-401] produced in E. coli did not 
demonstrate activity in the in vitro assay. 

OPG from several sources was produced as a 
dimer'and to some extent as a higher mul timer. Rat OPG 

30 [22-401] produced in transgenic mice, muOPG [22-401] 
and huOPG [22-401] produced as a recombinant 
polypeptide in CHO cells, and OPG expressed as a 
naturally occurring product from a cytotoxic T cell 
line were predominantly dimers and trimers when 

35 analyzed on nonr educing SDS gels (see Example 9) . 
Truncated OPG polypeptides having deletions in the 
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region of amino acids 186-401 (e.g*, OPG [1-185] and 
OPG [1-194]) were predominantly monomeric suggesting 
tliat the region 186-401 may be involved in self- 
association of OPG polypeptides. However, huOPG 
5 met [32-401] produced in E. coli was largely monomeric. 

OPG may be important in regulating bone 
resorption. Tlie protein appears to act as a soluble 
receptor of the TNF family and may prevent a receptor- 
ligand interaction involved in the osteolytic pathway. 
10 One aspect of the regulation appears to be a reduction 
in the nxamber of osteoclasts. 

Nucleic Acids 

The invention provides for an isolated 

15 nucleic acid encoding a polypeptide having at least one 
of the biological activities of OPG. As described 
herein, the biological activities of OPG include, but 
are not limited to, any activity involving bone 
metabolism and in particular, include increasing bone 

20 density. The nucleic acids of the invention are 
selected from the following: 

a) the nucleic acid sequences as shown in Figures 
2B-2C (SEQ ID NO:120), 9A-9B (SEQ ID NO:122), and 9C-9D 
(SEQ ID NO: 124) or complementary strands thereof; 

25 b) the nucleic acids. which hybridize under 

stringent conditions with the polypeptide-encoding 
region in Figures 2B-2C (SEQ ID NO:120), 9A-9B (SEQ ID 
NO:122), and 9C-9D (SEQ ID NO:124); and 

c) nucleic acids which hybridize under stringent 
3 0 conditions with nucleotides 148 through 337 inclusive 

as shown in Figure lA. 

d) the nucleic acid sequences which are degenerate 
to the sequences in (a) and (b) . 

The invention provides for nucleic acids 
35 which encode rat, mouse and human OPG as well as 

nucleic acid sequences hybridizing thereto which encode 



- 17 - 



a polypeptide having at least one of the biological 
activities of OPG, Also provided for are nucleic acids 
which hybridize to a rat OPG EST encompassing 
nucleotides 148-337 as shown in Figure lA. The 
5 conditions for hybridization are generally of high 
stringency such as 5xSSC, 50% formamide and 42 
described in Example 1 of the specification. 
Equivalent stringency to these conditions may be 
readily obtained by adjusting salt and organic solvent 

10 concentrations and temperature. The nucleic acids in 
(b) encompass sequences encoding OPG-related 
polypeptides which do not undergo detectable 
hybridization with other known members of the TNF 
receptor superfamily. In a preferred embodiment, the 

15 nucleic acids are as shown in Figures 2B-2C (SEQ ID 
NO:120). 9A-9B (SEQ ID NO:122), and 9C-9D (SEQ ID 
NO:124). 

The length of hybridizing nucleic acids of 
the invention may be variable since hybridization may 

20 occur in part or all of the polypeptide-encoding 

regions as shown in Figures 2B-2C. (SEQ ID NO: 120) , 9A- 
9B (SEQ ID NO:122), and 9C-9D (SEQ ID NO:124), and may 
also occur in adjacent noncoding regions. Therefore, 
hybridizing nucleic' acids may be truncations or 

25 extensions of the sequences shown in Figures 2B-2C (SEQ 
ID NO:120>, 9A-9B (SEQ ID NO:122), and 9C-9D (SEQ ID 
NO: 124). Truncated or extended nucleic acids are 
encompassed by the invention provided they retain one 
or more of the biological properties of OPG. The 

3 0 hybridizing nucleic acids may also include adjacent 
noncoding regions which are 5' and/ or 3' to the OPG 
coding region. The noncoding regions include 
regulatory regions involved in OPG expression, such as 
promoters, enhance, translational initiation sites, 

35 transcription termination sites and the like. 
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Hybridization conditions for nucleic acids 
are described in Sambrook et al. Molecular Cloning; A 
Laboratory Manual . 2nd ed. Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (1989) 
5 DNA encoding rat OPG was provided in plasmid 

pMO-Bl.l deposited with the American Type Culture 
Collection, Rockville, MD on December 27, 1995 under 
ATCC accession no. 69970. DNA encoding mouse OPG was 
provided in plasmid pRcCMV-murine OPG deposited with 

10 the American Type Culture Collection, Rockville, MD on 
December 27, 1995 under accession no. 69971. DNA 
encoding human OPG was provided in plasmid pRcCMV - 
h-uman OPG deposited with the American Type Culture 
Collection, Rockville, MD on December 27, 1995 under 

15 accession no. 69969. The nucleic acids of the 

invention will hybridize under stringent conditions to 
the DNA inserts of ATCC accession nos. 69969, 69970, 
and 69971 and have at least one of the biological 
activities of OPG. 

20 Also provided by the invention are 

derivatives of the nucleic acid sequences as shown in 
Figures 2B, 9 A and 9B. As used herein, derivatives 
include nucleic aci^ sequences having addition, 
substitution, insertion or deletion of one or more 

25 residues such that the resulting sequences encode 
polypeptides having one or more amino acid residues 
which have been added, deleted, inserted or substituted 
and the resulting polypeptide has the activity of OPG. 
The nucleic acid derivatives may be naturally 

3 0 occurring, such as by splice variation or polymorphism, 
or may be constructed using site-directed mutagenesis 
techniques available to the skilled worker. One 
example of a naturally occurring variant of OPG is a 
nucleic acid encoding a lys to asn change at residue 3 

3 5 within the leader sequence {see Example 5) . It is 

anticipated that nucleic acid derivatives will encode 
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amino acid changes in regions of the molecule which are 
least likely to disrupt biological activity. Other 
derivatives include a nucleic acid encoding a membrane- 
bound form of OPG having an extracellular domain as 
5 shown in Figures 2B-2C (SEQ ID NO:120), 9A-9B (SEQ ID 
NO:122), and 9C-9D (SEQ ID NO:124) along with 
transmembrane and cytoplasmic domains. 

In one embodiment, derivatives of OPG include 
nucleic acids encoding truncated forms of OPG having 

10 one or more amino acids deleted from the carboxy 

terminus. Nucleic acids encoding OPG may have from 1 
to 216 amino acids deleted from the carboxy teonninus . 
Optionally, an antibody Fc region may extend from the 
new carboxy terminus to yield a biologically active 

15 OPG-Fc fusion polypeptide, (see Example 11) . In 

preferred embodiments, nucleic acids encode OPG having 
the amino acid sequence from residues 22-185, 22-189, 
22-194 or 22-201 (using numbering in Figure 9E-F) and 
optionally, encoding an Fc region of human IgG. 

20 Also included are nucleic acids encoding 

truncated foirms of OPG having one or more amino acids 
deleted from the amino terminus. Truncated forms 
include those lacking part or all the 21 amino acids 
comprising the leader sequence. Additionally, the 

25 invention provides for nucleic acids encoding OPG 

having from 1 to 10 amino acids deleted from the mature 
amino terminus (at residue 22) and , optionally, having 
from 1 to 216 amino acids deleted from the carboxy 
terminus (at residue 401) . Optionally, the nucleic 

3 0 acids may encode a methionine residue at the amino 

terminus . Examples of such OPG truncated polypeptides 
are described in Example 8 . 

Examples of the nucleic acids of the 
invention include cDNA, genomic DNA, synthetic DNA and 

3 5 RNA. cDNA is obtained from libraries prepared from 

mRNA isolated from various tissues expressing OPG. In 
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humans, tissue sources for OPG include kidney, liver, 
placenta and heart. Genomic DNA encoding OPG is 
obtained from genomic libraries which are commercially 
available from a variety of species. Synthetic DNA is 
5 obtained by chemical synthesis of overlapping 

oligonucleotide fragments followed by assembly of the 
fragments to reconstitute part or all of the coding 
region and flanking sequences (see U*S. Patent No. 
4,695,623 describing the chemical synthesis of 

10 interferon genes) . RNA is obtained most easily by 

procaryotic expression vectors which direct high-level 
synthesis of mRNA, such as vectors using T7 promoters 
and RNA polymerase. 

Nucleic acid sequences of the invention are 

15 used for the detection of OPG sequences in biological 
samples in order to determine which cells and tissues 
are expressing OPG mRNA. The sequences may also be 
used to screen cDNA and genomic libraries for sequences 
related to OPG. Such screening is well within the 

20 capabilities of one skilled in the art using 
appropriate hybridization conditions to detect 
homologus sequences. The nucleic acids are also useful 
for modulating the expression of OPG levels by anti- 
sense therapy or gene therapy. The nucleic acids are 

25 also used for the development of transgenic animals 

which may be used for the production of the polypeptide 
and for the study of biological activity (see Example 
3) . 

3 0 Vectors and Host Cells 

Expression vectors containing nucleic acid 
sequences encoding OPG, host cells transformed with 
said vectors and methods for the production of OPG are 
also provided by the invention. An overview of 

35 expression of recombinant proteins is found in Methods 
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of Snzvmoloav v. 185; Goeddel, D.V. ed. Academic Press 
(1990) . 

Host cells for the production of OPG include 
procaryotic host cells, such as E. coli , yeast, plant, 
5 insect and mammalian host cells. E. coli strains such 
as HBlOl or JMlOl are suitable for expression. 
Preferred mammalian host cells include COS, CHOd-, 293, 
CV-1, 3T3, baby hamster kidney (BHK) cells and others. 
Mammalian host cells are preferred when post- 
10 trans lational modifications, such as glycosylation and 
polypeptide processing, are important for OPG activity. 
Mammalian expression allows for the production of 
secreted polypeptides which may be recovered from the 
growth medium. 

15 Vectors for the expression of OPG contain^ at 

a minimxim sequences required for vector propogation and 
for expression of the cloned insert. These sequences 
include a replication origin, selection marker, 
promoter, ribosome binding site, enhancer sequences, 

2 0 RNA splice sites and transcription termination site. 

Vectors suitable for expression in the aforementioned 
host cells are readily available and the nucleic acids 
of the invention are inserted into the vectors using 
standard recombinant DNA techniques* Vectors for 
25 tissue-specific expression of OPG are also included. 
Such vectors include promoters which function 
specifically in liver, kidney or other organs for 
production in mice, and viral vectors for the 
expression of OPG in targeted human cells. 

3 0 Using an appropriate host-vector system, OPG 

is produced recombinantly by culturing a host cell 
transformed with an expression vector containing 
nucleic acid sequences encoding OPG under conditions 
such that OPG is produced, and isolating the product of 
35 expression. OPG is produced, in the supernatant of 

transfected mammalian cells or in inclusion bodies of 
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transformed bacterial host cells. OPG so produced may 
be purified by procedures known to one skilled in the 
art as described below. The expression of OPG in 
mammalian and bacterial host systems is described in 
5 Examples 7 and 8. Expression vectors for mammalian 
hosts are exemplified by plasmids such as pDSRa 
described in PCT Application No. 90/14363. Expression 
vectors for bacterial host cells are exemplified by 
plasmids pAMG21 and pAMG22-His described in Example 8. 

10 Plasmid pAMG21 was deposited with the American Type 
Culture Collection, Rockville, MD on July 24, 1996 
under accession no. 98113. Plasmid pAMG22-His was 
deposited with the American Type Culture Collection, 
Rockville/ MD on July 24, 1996 under accession no. 

15 98112. It is anticipated that the specific plasmids 

and host cells described are for illustrative purposes 
and that other available plasmids and host cells could 
also be used to express the polypeptides. 

The invention also provides for expression of 

2 0 OPG from endogenous nucleic acids by in vivo or ex vivo 

recombination events to allow modulation of OPG from 
the host chromosome. Expression of OPG by the 
introduction of exogenous regulatory sequences (e.g. 
promoters or enhancers) capable of directing the 
25 production of OPG from endogenous OPG coding regions is 
also encompassed. Stimulation of endogenous regulatory 
sequences capable of directing OPG production (e.g. by 
exposure to transcriptional enhancing factors) is also 
provided by the invention. 

30 

Polypeptides 

The invention provides for OPG, a novel 
member of the TNF receptor superfamily, having an 
activity associated with bone metabolism and in 

3 5 particular- having the activity of inhibiting bone 

resorption thereby- increasing bone density. OPG refers 



- 23 - 



to a polypeptide having an amino acid sequence of 
mouse, rat or human OPG or a derivative thereof having 
at least one of the biological activities of OPG. The 
amino acid sequences of rat, mouse and human OPG are 
5 shown in Figures 2B-2C (SEQ ID NO: 121), 9A-9B (SEQ ID 
NO:123), and 9C-9D (SEQ ID NO:125) respectively. A 
derivative of OPG refers to a polypeptide having an 
addition, deletion, insertion or substitution of one or 
more amino acids such that the resulting polypeptide 

10 has at least one of the biological activities of OPG. 
The biological activities of OPG include, but are not 
limited to, activities involving bone metabolism. 
Preferably, the polypeptides will have the amino 
terminal leader sequence of 21 amino acids removed. 

15 OPG polypeptides encompassed by the invention 

include rat [1-401], rat [22-180], rat [22-401], rat 
[22-401] -Fc fusion, rat [1-180] -Fc fusion, mouse [1- 
401], mouse [1-180], mouse- [22-401] , human [1-401], 
mouse [22-180], hioman [22-401], human [22-180], human 

20 [1-180], human [22-180]-Fc fusion and human met-32-401. 
Amino acid nximbering is as shown in SEQ ID NO: 121 
(rat), SEQ ID NO:123 (mouse) and SEQ ID NO:125 (human). 
Also encompassed are polypeptide derivatives having 
deletions or carboxy-terminal truncations of part or 

25 all of amina acids residues 180-401 of OPG; one or more 
amino acid changes in residues 180-401; deletion of 
part or all of a cysteine-rich domain of OPG, in 
particular deletion of the distal (carboxy-terminal) 
cysteine-rich domain; and one or more amino acid 

3 0 changes in a cysteine-rich domain, in particular in the 
distal (carboxy-terminal) cysteine-rich domain. In one 
embodiment, OPG has from 1 to about 216 amino acids 
deleted from the carboxy terminus. In another 
embodiment, OPG has from 1 to about 10 amino acids 

3 5 deleted from the mature amino, terminus (wherein the 
mature amino terminus is at residue 22) and. 
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optionally, has from 1 to about 216 amino acids deleted 
from the carboxy terminus . 

Additional OPG polypeptides encompassed by 
the invention include the following: human [22-180] -Fc 
5 fusion, human [22-201] -Fc fusion, human [22-401] -Fc 
fusion, mouse [22-185] -Fc fusion, mouse [22-194] -Fc 
fusion. These polypeptides are produced in mammalian 
host cells, such as CHO or 293 cells. Additional OPG 
polypeptides encompassed by the invention which are 
10 expressed in procaryotic host cells include the 

following: human met [22-401] , Fc-human met [22-401] 
fusion (Fc region is fused at the amino terminus of the 
full-length OPG coding sequence as described in Example 
8), human met [22-401] -Fc fusion (Fc region fused to the 
15 full-lengh OPG sequence), Fc-mouse met [22-401] fusion, 
mouse met [22-401] -Fc fusion, human met[27-401], human 
met[22-185], human met [22-189] , human met [22-194] , 
hixman met [22-194] (P25A) , human met [22-194] (P26A) , 
human met[27-185], human met [27-189] , human met[27- 
20 194], human met-arg-gly-ser-(his)6 [22-401], human met - 
lys [22-401], hioman met- (lys) 3- [22-401] , human met[22- 
401]-Fc (P25A), hxaman met [22-401] (P25A) , human 
met [22-401] (P26A) , human met [22-401] (P26D) , mouse 
met[22-401], mouse met [27-401] , mouse met [32-401] , 
25 mouse met[27-180], mouse met[22-189], mouse met[22- 
194], mouse met [27-189] , mouse met [27-194] , mouse 
met-lys [22-401] , mouse HEK[22-401] (A45T) , mouse met- 
lys- (his) 7 [22-401] , mouse met-lys [22-401] - (his) 7 and 
mouse met [27-401] (P33E, G36S, A45P) . It is understood 
3 0 that the above OPG polypeptides produced in procaryotic 
host cells have an amino- terminal methionine residue, 
if such a residue is not indicated. In specific 
examples, OPG-Fc fusion were produced using a 227 amino 
acid region of human IgGl-yl was used having the 
35 sequence as shown in Ellison et al, (Nuc. Acids Res. 
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10 . 4071-4079 (1982)). However, variants of the Fc 
region of hioman IgG may also be used. 

Analysis of the biological activity of 
carboxy-terminal OPG truncations fused to the human 
5 IgGl Fc region indicates a portion of OPG of about 164 
amino acids which is required for activity. This 
region encompasses amino acids 22-185, preferably those 
in Figure 9C-9D (SEQ ID NO:125), and comprises four 
cysteine-rich domains characteristic of the cysteine- 

10 rich domains of TNFR extraceullular domains. 

Using the homology between OPG and the 
extracellular ligand binding domains of TNF receptor 
family members, a three-dimensional model of OPG was 
generated based upon the known crystal structure of the 

15 extracellular domain of TNFR-I (see Example 5) . This 
model was used to identify those residues within OPG 
which may be important for biological activity. 
Cysteine residues that are involved in maintaining the 
structure of the four cysteine-rich domains were 

2 0 identified. The following disulfide bonds were 

identified in the model: Domain 1: cys41 to cys54, 
cys44 to cys62, tyr23 and his 66 may act to stabilize 
the structure of this domain; Domain 2: cys65 to cys80, 
cys83 to cys98, cys87 to cysl05; Domain 3: cyslQ? to 
25 cysllS, cysl24 to cysl42; 'Domain 4: cysl45 to cysl60, 
cysl66 to cysl85. Residues were also identified which 
were in close proximity to TNF^ as shown in Figures 11 
and 12A-12B. In this model, it is assumed that OPG 
binds to a corresponding ligand; TNF^ was used as a 

3 0 model ligand to simulate the interaction of OPG with 

its ligand. Based upon this modeling, the following 
residues in OPG may be important for ligand binding: 
glu34, lys43, pro66 to gln91 (in particular, pro66, 
his68, tyr69, tyr70, thr71, asp72, ser73, his76, ser77, 
35 asp73, glu79, leu81, tyr82, pro85, val86, lys88, glu90 
and gln91) , glul53 and serl55. 
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Alterations in these amino acid residues, 
either singly or in combination, may alter the 
biological activity of OPG. For example, changes in 
specific cysteine residues may alter the structure of 
5 individual cysteine-rich domains, whereas changes in 
residues important for ligand binding may affect 
physical interactions of OPG with ligand. Structural 
models can aid in identifying analogs which have more 
desirable properties, such as enhanced biological 

10 activity, greater stability, or greater ease of 
formulation. 

The invention also provides for an OPG 
multimer comprising OPG monomers. OPG appears to be 
active as a multimer (e.g, dimer, trimer of a higher 

15 number of monomers) . Preferably, OPG multimers are. 

dimers or trimers. OPG multimers may comprise monomers 
having the amino acid sequence of OPG sufficient to ~ 
promote multimer formation or may comprise monomers 
having heterologous sequences such as an antibody Fc 

20 region. Analysis of carboxy- terminal deletions of OPG 
suggest that at least a portion of the region 186-401 
is involved in association of OPG polypeptides. 
Substitution of part or all of the region of OPG amino 
acids 186-401 with an amino acid sequence capable of 

25 self -association is also encompassed by the invention. 
Alternatively, OPG polypeptides or derivatives thereof 
may be modified to form dimers or multimers by site 
directed mutagenesis to create unpaired cysteine 
residues for interchain disulfide bond romation, by 

3 0 photochemical cross linking, such as exposure to 

ultraviolet light, or by chemical crosslinking with 
bifunctional linker molecules such as bifunctional 
polyethylene glycol and the like. 

Modifications of OPG polypeptides are 

3 5 encompassed by the invention and include post- 

translational modifications {e,g,. N-linked or O-linked 



carbohydrate chains, processing of N- terminal or 
C- terminal ends), attachment of chemical moieties to 
the amino acid backbone, chemical modifications of N- 
linked or 0-linked carbohydrate chains, and addition of 
an N-terminal methionine residue as a result of 
procaryotic host cell expression. The polypeptides may 
also be modified with a detectable label, such as an 
enzymatic, fluorescent, isotopic or affinity label to 
allow for detection and isolation of the protein. 

Further modifications of OPG include chimeric 
proteins wherein OPG is fused to a heterologous amino 
acid sequence. The heterologous sequence may be any 
sequence which allows the resulting fusion protein to 
retain the activity of OPG. The heterologous sequences 
include for example, immunoglobulin fusions, such as Fc 
fusions, which may aid in purification of the protein. 
A heterologous sequence which promotes association of 
OPG monomers to form dimers, trimers and other higher 
mul timer ic forms is preferred. 

The polypeptides of the invention are 
isolated and purified from other polypeptides present 
in tissues, cell lines and transformed host cells 
expressing OPG, or purified from components in cell 
cultures containing the secreted protein. In one 
embodiment, the polypeptide is free from association 
with other htiman proteins, such as the expression 
product of a bacterial host cell. 

Also provided by the invention are chemically 
modified derivatives of* OPG which may provide 
additional advantages such as increasing stability and 
circulating time of the polypeptide, or decreasing 
immunogenic ity (see U.S. Patent No. 4,179,337). The 
chemical moieties for derivitization may be selected 
from water soluble polymers such as polyethylene 
glycol, ethylene glycol /propylene glycol copolymers, 
carboxymethylcellulose, dextran, polyvinyl alcohol and 
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the like. The polypeptides may be modified at random 
positions within the molecule, or at predetermined 
positions within the molecule and may include one, two, 
three or more attached chemical moieties, 
5 The polymer may be of any molecular weight, 

and may be branched or unbranched. For polyethylene 
glycol, the preferred molecular weight is between about 
IkDa and about lOOkDa (the term "about" indicating that 
in preparations of polyethylene glycol, some molecules 

10 will weigh more, some less, than the stated molecular 
weight) for ease in handling and manufacturing. Other 
sizes may be used, depending on the desired therapeutic 
profile (e.g., the duration of sustained release 
desired, the effects, if any on biological activity, 

15 the ease in handling, the degree or lack of 
antigenicity and other known effects of the 
polyethylene glycol to a therapeutic protein or 
analog) . 

The polyethylene glycol molecules (or other 

20 chemical moieties) should be attached to the protein 
with consideration of effects on functional or 
antigenic domains of the protein. There are a number 
of attachment methods available to those skilled in the 
art, e.g. EP 0 401 3 84 herein incorporated by reference 

25 (coupling PEG to G-CSF) , see also Malik et al., Exp. 

Hematol, 20,: 1028-1035 (1992) (reporting pegylation of 
GM-CSF using tresyl chloride) , For example, 
polyethylene glycol may be covalently bound through 
amino* acid residues via a reactive group, such as, a 

3 0 free amino or carboxyl group. Reactive groups are 
those to which an activated polyethylene glycol 
molecule may be bound. The amino acid residues having 
a free amino group may include lysine residues and the 
N-terminal amino acid residues; those having a free 

35 carboxyl group may include aspartic acid residues 

glutamic acid residues and the C- terminal simino acid 
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residue. Sulfhydrl groups may also be used as a 
reactive group for attaching the polyethylene glycol 
molecule (s) . Preferred for therapeutic purposes is 
attachment at an amino group, such as attachment at the 
5 N- terminus or lysine group. 

One may specifically desire N-terminally 
chemically modified protein. Using polyethylene 
glycol as an illustration of the present compositions, 
one may select from a variety of polyethylene glycol 

10 molecules (by molecular weight, branching, etc.), the 
proportion of polyethylene glycol molecules to protein 
(or peptide) molecules in the reaction mix, the type of 
pegylation reaction to be performed, and the method of 
obtaining the selected N-terminally pegylated protein. 

15 The method of obtaining the N-terminally pegylated ■ 
preparation (i.e., separating this moiety from other 
monopegylated moieties if necessary) may be by 
purification of the N-terminally pegylated material 
from a population of pegylated protein molecules. 

20 Selective N-terminal chemically modification may be 
accomplished by reductive alkylation which exploits 
differential reactivity of different types of primary 
amino groups (lysine versus the N-terminal) available 
for derivatization in a particular protein. Under the 

25 appropriate reaction conditions, substantially 
selective derivatization of the protein at the 
N- terminus with a carbonyl group containing polymer is 
achieved. 

Synthetic OPG dimers may be prepared by 
30 various chemical crosslinking procedures. OPG monomers 
may be chemically linked in any fashion that retains or 
enhances the biological activity of OPG. A variety of 
chemical crosslinkers may be used depending upon which 
properties of the protein dimer are desired. For 
3 5 example, crosslinkers may be short and relatively rigid 
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or longer and more flexible, may be biologically 
reversible, and may provide reduced immunogenic ity or 
longer pharmacokinetic half -life. 

In one example, OPG molecules are linked 
5 through the amino terminus by a two step synthesis (see 
Example 12) . In the first step, OPG is chemically 
modified at the amino terminus to introduce a protected 
thiol, which after purification is deprotected and used 
as a point of attachment for site-specific conjugation 

10 through a variety of crosslinkers with a second OPG 

molecule. Amino-terminal crosslinks include, but are 
not limited to, a disulfide bond, thioether linkages 
using short-chain, bis-functional aliphatic 
crosslinkers, and thioether linkages to variable 

15 length, bifunctional polyethylene glycol crosslinkers 
(PEG ''dumbbells") . Also encompassed by PEG dumbbell 
synthesis of OPG dimers is a byproduct of such 
synthesis, termed a "monobell". An OPG monobell 
consists of a monomer coupled to a linear bifunctional 

20 PEG with a free polymer terminus. Alternatively, OPG 
may be crosslinked directly through a variety of amine 
specific homobi functional crosslinking techniques which 
include reagents such as: diethylenetriaminepentaacetic 
dianhydride (DTPA) , p-benzoquinone (pBQ) or 

25 bis (sulfosuccinimidyl) suberate (BS^) as well as others 
known in the art. It is also possible to thiolate OPG 
directly with reagents such as iminothiolane in the 
presence of a variety of bifunctional, thiol specific 
crosslinkers, such as PEG bismaleimide, and achieve 

3 0 dimerization and/or dumbbells in a one step process. 

A method for the purification of OPG from 
natural sources and from transfected host cells is also 
included. The purification process may employ one or 
more standard protein purification steps in an 

35 appropriate order to obtain pjurified protein. The 
chromatography steps can include ion exchange, gel 
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filtration, hydrophobic interaction, reverse phase, 
chromatof ocusing, affinity chromatography employing an 
anti-OPG antibody or biotin-streptavidin affinity 
complex and the like. 

5 

Antibodies 

Also encompassed by the invention are 
antibodies specifically binding to OPG, Antigens for 
the generation of antibodies may be full-length 

10 polypeptides or peptides spanning a portion of the OPG 
sequence. Immunological procedures for the generation 
of polyclonal or monoclonal antibodies reactive with 
OPG are known to one skilled in the art (see, for 
example, Harlow and Lane, Antibodies: A Laboratory 

15 Manual Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor N.Y. (1988)). Antibodies so produced are 
characterized for binding specificity and epitope 
recognition using standard enzyme-linked immunosorbent 
assays. Antibodies also include chimeric antibodies 

20 having variable and constant domain regions derived 
from different species. In one embodiment, the 
chimeric antibodies are humanized antibodies having 
murine variable domains and hxaman constant domains. 
Also encompassed are complementary determining regions 

25 grafted to a human framework (so-called CDR-grafted 

antibodies) . Chimeric and CDR-grafted antibodies are 
made by recombinant methods known to one skilled in the 
art. Also encompassed are human antibodies made in 
mice. 

3 0 Anti-OPG antibodies of the invention may be 

used as an affinity reagent to purify OPG from 
biological samples (see Example 10) . In one method, 
the antibody is immobilized on CnBr-activated Sepharose 
and a column of antibody-Sepharose conjugate is used to 

35 remove OPG- from liquid samples. Antibodies are also 
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used as diagnostic reagents to detect and quantitate 
OPG in biological samples by methods described below. 

Phanaac eu t i c a 1 c ompo s i t i on s 
5 The invention also provides for 

pharmaceutical compositions comprising a 
therapeutically effective amount of the polypeptide of 
the invention together with a pharmaceutical ly 
acceptable diluent, carrier, solubilizer, emulsifier, 

10 preservative and/or adjuvant. The term 

"therapeutically effective amount" means an amount 
which provides a therapeutic effect for a specified 
condition and route of administration. The composition 
may be in a liquid or lyophilized form and comprises a 

15 diluent (Tris, acetate or phosphate buffers) having 

various pH values and ionic strengths, solubilizer such 
as Tween or Polysorbate, carriers such as human serum 
albumin or gelatin, preservatives such as thimerosal or 
benzyl alcohol, and antioxidants such as ascrobic acid 

20 or sodium metabisulf ite . Also encompassed are 

compositions comprising OPG modified with water soluble 
polymers to increase solubility or stability. 
Compositions may also comprise incorporation of OPG 
into liposomes, microemulsions , micelles or vesicles 

25 for controlled delivery over an extended period of 
time. 

Specifically, OPG compositions may comprise 
incorporation into polymer matricies such as hydrogels, 
silicones, polyethylenes , ethylene -vinyl acetate 

30 copolymers, or biodegradable polymers. Examples of 
hydrogels include polyhydroxyalkylmethacrylates (p- 
HEMA) , polyacrylamide, polymethacrylamide, 
polyvinylpyrrolidone, polyvinyl alcohol and various 
polyelectrolyte complexes. Examples of biodegradable 

3 5 polymers include polylactic acid (PLA) , polyglycolic 
acid (PGA) , copolymers of PLA and PGA, polyamides and 
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copolymers of polyamides and polyesters. Other 
controlled release formulations include microcapsules, 
microspheres, macromolecular complexes and polymeric 
beads which may be administered by injection. 
5 Selection of a particular composition will depend 

upon a number of factors, including the condition being 
treated, the route of administration and the 
pharmacokinetic parameters desired. A more extensive 
survey of component suitable for pharmaceutical 
10 compositions is found in Remington ' s Pharmaceutical 

Sciences . 18th ed. A.R. Gennaro, ed. Mack, Easton, PA 
(1980) . 

Compositions of the invention may be 
administered by injection, either subcutaneous, 
15 intravenous or intramuscular, or by oral, nasal, 
pulmonary or rectal administration • The route of 
administration eventually chosen will depend upon a 
number of factors and may be ascertained by one skilled 
in the art. 

20 The invention also provides for 

pharmaceutical compositions comprising a 
therapeutically effective amount of the nucleic acids 
of the invention together with a pharmaceutically 
acceptable adjuvant. Nucleic acid compositions will be 

25 suitable for" the delivery "of part or all of the OPG 

coding region to cells and tissues as part of an anti- 
sense or gene therapy regimen. 

Methods of Treatment 

30 Bone tissue provides support for the body and 

consists of mineral (largely calcium and phosphorous) , 
a matrix of collagenous and noncollagenous proteins, 
and cells. Three types of cells found in bone, 
osteocytes, osteoblasts and osteoclasts, are involved 

3 5 in the dynamic process by which bone is continually 

formed and resorbed. Osteoblasts promote formation of 



bone tissue whereas osteoclasts are associated with 
resorption. Resorption, or the dissolution of bone 
matrix and mineral, is a fast and efficient process 
compared to bone formation and can release large 
amounts of mineral from bone. Osteoclasts are involved 
in the regulation of the noirmal remodeling of skeletal 
tissue and in resorption induced by hormones. For 
instance, resorption is stimulated by the secretion of 
parathyroid hormone in response to decreasing 
concentrations of calcium ion in extracellular fluids. 
In contrast, inhibition of resorption is the principal 
function of calcitonin. In addition, metabolites of 
vitamin D alter the responsiveness of bone to 
parathyroid hormone and calcitonin. 

After skeletal maturity, the amount of bone 
in the skeleton reflects the balance (or imbalance) of 
bone formation and bone resorption. Peak bone mass 
occurs after skeletal maturity prior to the fourth 
decade. Between the fourth and fifth decades, the 
equilibrixam shifts and bone resorption dominates. The 
inevitable decrease in bone mass with advancing years 
starts earlier in females than males and is distinctly 
accelerated after menopause in some females 
(principally those of Caucasian and Asian descent) . 

Osteopenia is a condition relating generally 
to any decrease in bone mass to below normal levels. 
Such a condition may arise from a decrease in the rate 
of bone synthesis or an increase in the rate of bone 
destruction or both. The most common form of 
osteopenia is primary osteoporosis, also referred to as 
postmenopausal and senile osteoporosis. This form of 
osteoporosis is a consequence of the universal loss of 
bone with age and is usually a result of increase in 
bone resorption with a normal rate of bone formation. 
About 25 to 30 percent of all. white females in the 
United States develop symptomatic osteoporosis. A 
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direct relationship exists between osteoporosis and the 
incidence of hip, femoral, neck and inter- trochanteric 
fracture in women 45 years and older. Elderly males 
develop symptomatic osteoporosis between the ages of 50 
5 and 70, but the disease primarily affects females. 

The cause of postmenopausal and senile 
osteoporosis is unknown* Several factors have been 
identified which may contribute to the condition. They 
include alteration in hormone levels accompanying aging 

10 and inadequate calcium consumption attributed to 

decreased intestinal absorption of calcium and other 
minerals. Treatments have usually included hormone 
therapy or dietary supplements in an attempt to retard 
the process. To date, however, an effective treatment 

15 for bone loss does not exist. 

The invention provides for a method of 
treating a bone disorder using a therapeutically 
effective amount of OPG. The bone disorder may be any 
disorder characterized by a net bone loss (osteopenia 

20 or osteolysis) . In general, treatment with OPG is 

anticipated when it is necessary to suppress the rate 
of bone resorption. Thus treatment may be done to 
reduce the rate of bone resorption where the resorption 
rate is above normal or to reduce bone resorption to 

25 below normal levels in order to compensate for below 
normal levels of bone formation. 

Conditions which are treatable with OPG 
include the following: 

Osteoporosis, such as primary osteoporosis, 

3 0 endocrine osteoporosis (hyperthyroidism, 

hyperparathryoidism, Gushing * s syndrome , and 
acromegaly) , hereditary and congenital forms of 
osteoporosis (osteogenesis imperfecta, homocystinuria, 
Menkes' syndrome, and Riley-Day syndrome) and 

35 osteoporosis due to immobilization of extremities. 
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Paget 's disease of bone (osteitis deformans) 
in adults and juveniles 

Osteomyelitis, or an infectious lesion in 
bone, leading to bone loss. 
5 Hypercalcemia resulting from solid tumors 

(breast, lung and kidney) and hematologic malignacies 
(multiple myeloma, lymphoma and leukemia) , idiopathic 
hypercalcemia, and hypercalcemia associated with 
hyperthryoidism and renal function disorders. 
10 Osteopenia following surgery, induced by 

steroid administration, and associated with disorders 
of the small and large intestine and with chronic 
hepatic and renal diseases. 

Osteonecrosis, or bone cell death, associated 
15 with traumatic injury or nontraumatic necrosis 

associated with Gaucher 's disease, sickle cell anemia, 
systemic lupus erythematosus and other conditions. 

Bone loss due to rheumatoid arthritis. 
Periodontal bone loss. 

2 0 Osteolytic metastasis 

It is understood that OPG may be used alone 
or in conjunction with other factors for the treatment 
of bone disorders. In one embodiment, osteoprotegerein 
is used in conjunction with a therapeutically effective 
25 amount of a factor which stimulates bone formation. 
Such factors include but are not limited to the bone 
morphogenic factors designated BMP-1 through BMP-12, 
transforming growth factor-p (TGF-p) and TGF-p family 
members, interleukin-1 dnhibitors, TNFa inhibitors, 

3 0 parathyroid hormone and analogs thereof, parathyroid 

related protein and analogs thereof, E series 
prostaglandins, bisphosphonates (such as alendronate 
and others) , and bone -enhancing minerals such as 
fluoride and calcium. 



35 
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The following examples are offered to more 
fully illustrate the invention, but are not construed 
as limiting the scope thereof. 



EXAMPLE 1 
Identification and isolation of the 
rat OPG cDNA 

Materials and methods for cDNA cloning and 
analysis are described in Maniatis et al, ibid . 
Polymerase chain reactions (PGR) were performed using a 
Perkin-Elmer 9600 thermocycler using PGR reaction 
mixture (Boehringer-Mannheim) and primer concentrations 
specified by the manufacturer. In general, 25-50 |Xl 
reactions were denatured at 94 ^G, followed by 2 0-40 
cycles of SA^C for 5 seconds, 50-60<^G for 5 seconds, 
and 72 ^G for 3-5 minutes. Reactions were the treated 
for 72 ^G for 3-5 minutes. Reactions were then 
analyzed by gel electrophoresis as described in 
Maniatis et al., ibid. 

A cDNA library was constructed using mRNA 
isolated from embryonic d20 intestine for EST analysis 
(Adams et al. Science 252, 1651-1656 (1991)). Rat 
embryos were dissected, and the entire developing small 
and large intestine removed and washed in PBS. Total 
cell RNA was purified by acid guanidinium thiocyanate- 
phenol -chloroform extraction (Ghomczynski and Sacchi 
Anal. Biochem. 162, 156-159, (1987)). The poly {A+) 
mRNA fraction was obtained from the total RNA 
preparation by adsorption to, and elution from, 
Dynabeads Oligo (dT)25 (Dynal Gorp) using the 
manufacturer's recommended procedures. A random primed 
CDNA library was prepared using the Superscript Plasmid 
System (Gibco BRL, Gaithersburg, Md) . The random cDNA 
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primer containing an internal Not I restriction site 
was used to initiate first strand synthesis and had the 
following sequence: 

5 ' -AAAGGAAGGAAAAAAGCGGCCGCTACANNITOI^ ' (SEQ ID 

5 N0:1) 

Not I 

For the first strand synthesis three separate 
reactions were assembled that contained 2,5 ^ig of 
poly (A) RNA and 120 ng, 360 ng or 1,080 ng of random 

10 primer. After second strand synthesis, the reaction 
products were separately extracted with a mixture of 
phenol :choroform:isoamyl alcohol (25:24:1 ratio), and 
then ethanol precipitated. The double strand (ds) cDNA 
products of the three reactions were combined and 

15 ligated to the following ds oligonucleotide adapter: 

5 ' -TCGACCCACGCGTCCG-3 ' ( SEQ ID NO : 2 ) 
3 ' -GGGTGCGCAGGCp-5 ' ( SEQ ID NO : 3 ) 

2 0 After ligation the cDNA was digested to 

completion with Not I, extracted with 

phenol: chloroform risoamyl (25:24:1) alcohol and ethanol 
precipitated. The resuspended cDNA was then size 
fractionated by gel filtration using premade columns 
25 provided with the Superscript Plasmid System (Gibco 
BRL, Gaithersburg, Md) as recommended by the 
manufacturer. The two fractions containing the largest 
cDNA products were pooled, ethanol precipitated and 
then directionally ligated into Not I and Sal I 

3 0 digested pMOB vector DNA (Strathmann et al, 1991) . The 

ligated cDNA was introduced into competent ElectroMAX 
DHIOB E. coli (Gibco BRL, Gaithersburg, MD) by 
electroporation. For automated sequence analysis 
approximately 10,000 transf ormants were plated on 20cm 
35 X 20cm agar plates containing ampicillin supplemented 

LB nutrient media. The colonies that arose were picked 



and arrayed onto 96 well microtiter plates containing 
200 ml of L-broth, 7.5% glycerol, and 50 )lg/ml 
ampicillin. The cultures were grown overnight at 37°C, 
a duplicate set of microtiter plates were made using a 
sterile 96 pin replicating tool, then both sets were 
stored at -SO^C for further analysis. For full-length 
cDNA cloning approximately one million transf ormants 
were plated on 96 bacterial ampicillin plates 
containing about 10,000 clones each. The plasmid DNA 
from each pool was separately isolated using the Qiagen 
Plasmid Maxi Kit (Qiagen Corp, , Germany) and arrayed 
into 96 microtiter plates for PCR analyses. 

To sequence random fetal rat intestine cDNA 
clones, glycerol stocks were thawed, and small aliquots 
diluted 1:25 in distilled. Approximately 3.0 ul of 
diluted bacterial cultures were added to PCR reaction 
mixture (Boehringer -Mannheim) containing the following 
oligonucleotides : 

5' -TGTAAAACGACGGCCAGT-3 ' (SEQ ID NO: 4) 
5 ' -CAGGAAACAGCTATGACC-3 ' { SEQ ID NO : 5 ) 

The reactions were incubated in a 
theirmocycler (Perkin-Elmer 9600) with the following 
cycle conditions: 94 C for 2 minutes; 3 0 cycles of 
94^0 for 5 seconds, 50^C for 5 seconds, and 72°C for 3 
minutes.; 72<^C for 4 minutes. After incubation in the 
thermocycler, the reactions were diluted with 2.0 mL of 
water. The amplified DNA fragments were further 
purified using Centricon columns (Princeton 
Separations) using the manufacturer's recommended 
procedures , The PCR reaction products were sequenced 
on an Applied Biosystems 373A automated DNA sequencer 
using T3 primer (oligonucleotide 3 53-23; 5^- 
CAATTAACCCTCACTAAAGG-3 ' ) (SEQ ID NO: 6) Taq dye- 
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terminator reactions (Applied Biosystems) following the 
manufacturer's recommended procedures. 

The resulting 5' nucleotide sequence obtained 
from randomly picked cDNA clones translated and then 
compared to the existing database of known protein 
sequences using a modified version of the FASTA 
program (Pearson et al, Meth. Enzymol . 183, (1990)). 
Translated sequences were also analysed for the 
presence of a specific cysteine-rich protein motif 
found in all known members of the tumor necrosis 
factor receptor (TNFR) superfamily (Smith et al , Cell 
76, 959-962 (1994)), using the sequence profile method 
of Gribskov et al , (Proc, Natl. Acad. Sci. USA 83, 
4355-4359 (1987)), as modified by Luethy et al . 
(Protein Science 3, 139-146 (1994)). 

Using the FASTA and Profile search data, an 
EST, FRI-1 (Fetal Rat Intestine-1) , was identified as a 
possible new member of the TNFR superfamily, FRI-1 
contained an approximately 600 bp insert with a LORF of 
about 150 amino acids. The closest match in the 
database was the human type II TNFR (TNFR-2) . The 
region compared showed an -43% homology between TNFR-2 
and FRI-1 over this 150 aa LORF. Profile analysis 
using the first and second cysteine-rich repeats of the 
TNFR superfamily yielded a Z score of -8, indicating 
that the FRI-1 gene possibly encodes a new family 
member . 

To deduce the structure of the FRI-1 product, the fetal 
rat intestine cDNA library was screened for full length 
clones. The following oligonucleotides were derived 
from the original FRI-1 sequence: 

5 ' -GCATTATGACCCAGAAACCGGAC-3 * ( SEQ ID NO : 7 ) 
5 ' -AGGTAGCGCCCTTCCTCACATTC-3 ' ( SEQ ID NO : 8 ) 
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These primers were used in PGR reactions to 
screen 96 pools of plasmid DNA, each pool containing 
plasmid DNA from 10,000 independent cDNA clones. 
Approximately 1 ug of plasmid pool DNA was amplified in 
5 a PGR reaction mixture (Boehringer-Mannheim) using a 
Perkin-Elmer 96 well thermal cycler with the following 
cycle conditions: 2 min at 94^C,1 cycle; 15 sec at 
94^C, then 45 sec at 65°C, 30 cycles; 7 min at 65<=»G; 1 
cycle. PGR reaction products were analysed by gel 

10 electrophoresis. 13 out of 96 plasmid DNA pools gave 
rise to amplified DNA products with the expected 
relative molecular mass. 

DNA from one positive pool was used to 
transform competent ElectroMAX DHIOB E. coli (Gibco 

15 BRL, Gaithersburg, MD) as described above. 

Approximately 40,000 transf ormants were plated onto 
sterile nitrocellulose filters {BA-85, Schleicher and 
Schuell) , and then screened by colony hybridization 
using a 32p_dGTP labelled version of the PGR product 

2 0 obtained above. Filters were prehybridized in 5X SSG, 

50% deionized formamide, 5X Denhardt's solution, 0.5% 
SDS, and 100 ug/ml denatured salmon sperm DNA for 2-4 
hours at 42 °G, Filters were then hybridized in 5X SSG, 
50% deionized formamide, 2X Denhardt's solution, 0.1% 
25 SDS, 100 fxg/ml denatured salmon sperm DNA, and -5 ng/ml 
of labelled probe for -18 hours at 42 °G. The filters 
were then washed in 2X SSG for 10 min at RT, IX SSG for 
10 min at SS^C, and finally in 0.5X SSG for 10-15 min 
at 55^G. Hybridizing clones were detected following 

3 0 autoradiography, and then replated onto nitrocellulose 

filt ers for secondary screening. Upon secondary 
screening, a plasmid clone (pBl.l) was isolated, then 
amplified in L-broth media containing 100 ug/ml 
ampicillin and the plasmid DNA obtained. Both strands 
3 5 of the 2.4 kb pBl.l insert were sequenced. 
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The pBl.l insert sequence was used for a 
FASTA search of the public database to detect any 
existing sequence matches and/or similarities. No 
matches to any known genes or EST's were found, 
5 although there was an approximate 45% similarity to the 
human and mouse TNFR-2 genes. A methionine start codon 
is found at bp 124 of the nucleotide sequence, followed 
by a LORF encoding 401 aa residues that terminates at 
bp 1327. The 401 aa residue product is predicted to 
10 have a hydrophobic signal peptide of approximately 31 

residues at its N- terminus, and 4 potential sites of N- 
linked glycosylation . No hydrophobic transmembrane 
spanning sequence was identified using the PepPlot 
program (Wisconsin GCG package, version 8.1). The 
15 deduced 401 aa sequence was then used to search the 
protein database. Again, there were no existing 
matches, although there appeared to be a strong 
similarity to many members of the TNFR superfamily, 
most notably the hiiman and mouse TNFR-2 . A sequence 
20 alignment of this novel protein with known members of 
the TNFR- superfamily was prepared using the Pileup 
program, and then modified by PrettyPlot (Wisconsin GCG 
package, version 8.1). This alignment shows a clear 
homology between the full length FRI-1 gene product and 
25 all other TNFR family members. The homologus region 
maps to the extracellular domain of TNFR family 
members, and corresponds to the three or four cysteine- 
rich repeats found in the ligand binding domain of 
these proteins. This suggested that the FRI-1 gene 
30 encoded a novel TNFR family member. Since no 

transmembrane spanning region was detected we predicted 
that this may be a secreted receptor, similar to TNFR-1 
derived soluble receptors (Kohno et al. Proc. Natl. 
Acad. Sci. USA 87, 8331-8335 (1990)). Due to the 
3 5 apparent biological activity of the FRI-1 gene (vide 
infra) , the product was named Osteoprotegerin (OPG) . 
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EXAMPLE 2 

OPG iriRNA Expression Patterns in Tissues 

5 

Multiple human tissue northern blots 
(Clonetech) were probed with a ^^p.^^rpp labelled FRl-1 
PGR product to detect the size of the human transcript 
and to determine patterns of expression. Northern 

10 blots were prehybridized in 5X SSPE, 50% formamide, 5X 
Denhardt's solution, 0.5% SDS, and 100 |Xg/ml denatured 
salmon sperm DNA for 2-4 hr at 42 ^C. The blots were 
then hybridized in 5X SSPE, 50% formamide, 2X 
Denhardt's solution, 0.1% SDS, 100 |Xg/ml denatured 

15 salmon sperm DNA, and 5 ng/ml labelled probe for 18-24 
hr at 42 The blots were then washed in 2X BSC for 
10 min at RT, IX SSC for 10 min at 50^C, then in 0 . 5X 
BSC for 10-15 min. 

Using a probe derived from the rat gene, a 

20 predominant mRNA species with a relative molecular mass 
of about 2.4 kb is detected in several tissues, 
including kidney, liver, placenta, and heart. Highest 
levels are detected in the kidney. A large mRNA 
species of Mr 4.5 and 7 . 5 kb was detected in skeletal 

25 muscle and pancreas. In human fetal tissue, kidney was 
found to express relatively high levels of the 2,4 kb 
mRNA, Using a human probe (vide infra), only the 2.4 
kb transcript is detected in these same tissues. In 
addition, relatively high levels of the 2.4 kb 

3 0 transcript was detected in the lymph node, thymus, 
spleen and appendix. The size of the transcript 
detected by both the rat and human Osteosprotegerin 
gene is almost identical to the length of the rat pBl . 1 
FRI-1 insert, suggesting it was a full length cDNA 

3 5 clone. 
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Systemic delivery of OPG in transgenic mice 

The rat OPG clone pBl . 1 was used as template 
5 to PGR amplify the coding region for subcloning into an 
ApoE-liver specific expression vector (Simonet et al . 
J. Clin. Invest. 94, 1310-1319 (1994), and PCT 
Application No. US94/11575 and co-owned U.S. Serial No. 
08/221,767. The following 5' and 3' oligonucleotide 
10 primers were used for PGR amplification, respectively: 

5 ' -GACTAGTCGCACAATGAACAAGTGGCTGTG-3 * (SEQ ID NO: 9) 
5 ' -ATAAGAATGCGGCCGCTAAACTATGAAACAGCCCAGTGACCATTC-3 ' 
{SEQ ID NO: 10) 

15 

The PGR reaction mixture (Boehringer- 
Mannheim) was treated as follows: 94^G for 1 minute, 1 
cycle; 94^G for 20 sec, 62°G for 30 sec, and 74 G for 1 
minute, 25 cycles. Following amplification, the 

20 samples were purified over Qiagen PGR columns and 
digested overnight with Spel and NotI restriction 
enzymes. The digested products were extracted and 
precipitated and subcloned into the ApoE promoter 
expression vector. Prior to microinjecting the 

25 resulting clone, HE-OPG, it was sequenced to ensure it 
was mutation-free. 

The HE-OPG plasmid was purified through two 
rounds of CsGl density gradient centrif ugation. The 
purified plasmid DNA was digested with Xhol and Ase I, 

3 0 and the 3.6 kb transgene insert was purified by gel 

electrophoresis. The purified fragment was diluted to 
a stock injection solution of 1 |ig/ml in 5 mM Tris, pH 
7.4, 0.2 mM EDTA. Single-cell embryos from BDFl x 
BDFl-bred mice were injected essentially as described 

35 (Brinster et al , , Proc , Natl.. Acad. Sci. USA 82, 4338 

(1985)), except that injection needles were beveled and 
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siliconized before use. Embryos were cultured 
overnight in a CO2 incubator and 15 to 20 2 -cell 

embryos were transferred to the oviducts of 
pseudopregnant GDI female mice. 
5 Following term pregnancy, 49 offspring were 

obtained from implantation of microinjected embryos. 
The offspring were screened by PGR amplification of the 
integrated transgene in genomic DNA samples . The target 
region for amplification was a 3 69 bp region of the 
10 human Apo E intron which was included in the expression 
vector. The oligos used for PGR amplification were: 

5'- GGC TGT AGA AAG AGG TGG GAG-3 ' (SEQ ID N0:11) 
5'- GGG GGT GTT GCA TTT ATG AGG-3 ' . (SEQ ID NO: 12) 

15 

The conditions for PGR were: 94^G for 2 
minute, 1 cycle; 94°G for 1 min, 63°G for 20 sec, and 
72 °G for 3 0 sec, 3 0 cycles. Of the 49 original 
offspring, 9 were identified as PGR positive transgenic 
20 founders. 

At 8-10 weeks of age, five transgenic 
founders (2, 11, 16, 17, and 28) and five controls ( 1, 
12, 15, 18, and 30) were sacrificed for necropsy and 
pathological analysis , Liver was isolated from the 

25 remaining 4 founders by partial hepatectomy. For 

partial hepatectomy, the mice were anesthetized and a 
lobe of liver was surgically removed. Total cellular 
RNA was isolated from livers of all transgenic 
founders, and 5 negative control litteinfnates as 

30 described {McDonald et al . Meth. Enzymol. 152, 219 

(1987)) . Northern blot analysis was performed on these 
samples to assess the level of transgene expression. 
Approximately lOug of total RNA from each animal liver 
was resolved by electrophoresis denaturing gels (Ogden 

35 et al. Meth. Enzymol 152 . 61 (1987)), then transferred 
to HYBOND-N nylon membrane (Amersham) , and probed with 
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dCTP-labelled pBl.l insert DNA. Hybridization was 
performed overnight at 42 in 50% Formamide, 5 x SSPE, 
0.5% SDS, 5 X Denhardt's solution, 100 |Xg/ml denatured 
salmon sperm DNA and 2-4 x 10^ cpm of labeled probe/ml 
of hybridization buffer. Following hybridization, 
blots were washed twice in 2 x SSC, 0.1% SDS at room 
temperature for 5 min each, and then twice in 0 . 1 x 
SSC, 0,1% SDS at 55<>C for 5-10 min each. Expression of 
the transgene in founder and control littermates was 
determined following autoradiography. 

The northern blot data indicate that 7 of the 
transgenic founders express detectable levels of the 
transgene mRNA (animal #'s 2, 11, 16, 17, 22, 33, and 45). 
The negative control mice and one of the founders (#28) 
expressed no transgene-related mRNA. Since OPG is 
predicted to be a secreted protein, overexpression of 
transgene mRNA should be a proxy for the level of 
systemically delivered gene product. Of the PCR and 
northern blot positive mice, animal 2, 17 and 22 
expressed the highest levels of transgene mRNA, and may 
show more extensive biological effects on host cells 
and tissues. 



EXAMPLE 4 
Biological activity of OPG 

Five of the transgenic mice (animals 
2,11,16,17 and 28) and 5 control littermates (animals 
1,12,15,18, and 30) were sacrificed for necropsy and 
pathological analysis using the following procedures: 
Prior to euthanasia, all animals had their 
identification numbers verified, then were weighed, 
anesthetized and blood drawn. The blood was saved as 
both serum and whole blood for a complete serum 
chemistry and hematology panel. Radiography was 
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performed just after terminal anesthesia by lethal C02 
inhalation, and prior to the gross dissection. 
Following this, tissues were removed and fixed in 10% 
buffered Zn-Formalin for histological examination. The 
tissues collected included the liver, spleen, pancreas, 
stomach, duodenum, ilexim, colon, kidney, reproductive 
organs, skin and mammary glands, bone, brain, heart, 
lung, thymus, trachea, eosphagus, thyroid, jejunem, 
cecum, rectum, adrenals, urinary bladder, and skeletal 
muscle. Prior to fixation the whole organ weights were 
determined for the liver, stomach, kidney, adrenals, 
spleen, and thymus. After fixation the tissues were 
processed into paraffin blocks, and 3 um sections were 
obtained. Bone tissue was decalcified using a formic 
acid solution, and all sections were stained with 
hematoxylin and eosin. In addition, staining with 
Gomori's reticulin and Masson's trichrome were 
performed on certain tissues. Enzyme histochemistry 
was performed to determine the expression of tartrate 
resistant acid phosphatase (TRAP) , an enyzme highly 
expressed by osteoclasts, multinucleated bone-resorbing 
cells of monocyte -macrophage lineage. 
Immunohistochemistry for BrdU and F480 monocyte- 
macrophage surface antigen was also performed to detect 
replicating. cells and cells of the monocyte-macrophage 
lineage, respectively. To detect F480 surface antigen 
expression, formalin fixed, paraffin embedded 4pm 
sections were deparaf f inized and hydrated to deionized 
water. The sections were quenched with 3% hydrogen 
peroxide, blocked with Protein Block (Lipshaw, 
Pittsburgh, PA) , and incubated in rat monoclonal anti- 
mouse F480 (Harlan, Indianapolis, IN) . This antibody 
was detected by biotinylated rabbit anti-rat 
immunoglobulins, peroxidase conjugated strepavidin 
(BioGenex San Ramon, CA) with DAB as chromagen (BioTek, 
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Santa Barbara, CA) . Sections were counterstained with 
hematoxylin. 

Upon gross dissection and observation of 
visceral tissues, no abnormalities were found in the 
5 transgene expressors or control littermates. Analysis 
of organ weight indicate that spleen size increased by 
approximately 38% in the transgenic mice relative to 
controls. There was a slight enlargement of platelet 
size and increased circulating unstained cells in the 

10 transgene expressors. There was a marginal decrease in 
platelet levels in the transgene expressors. In 
addition, the serum uric acid, urea nitrogen, and 
alkaline phosphatase levels all trended lower in the 
transgene expressors . The expressors were found to 

15 have increased radiodensity of the skeleton, including 
long bones (femurs), vertebrae, and flat bones 
(pelvis) , The relative size of femurs in the 
expressors were not different from the the control 
mice . 

2 0 Histological analysis of stained sections of 

bone from the OPG expressors show severe osteopetrosis 
with the presence of cartilage remnants from the 
primary spongiosa seen within bone trabeculae in the 
diaphysis of the femur. A clearly defined cortex was 
25 not identifiable in the sections of femur. In normal 
animals, the central diaphysis is filled with bone 
marrow. Sections of vertebra also show osteopetrotic 
changes implying that the OPG-induced skeletal changes 
were systemic. The residual bone marrow showed 

3 0 predominantly myeloid elements. Megakaryocytes were 

present. Reticulin stains showed no evidence for 
reticulin deposition. Immunohistochemistry for F480, a 
cell surface antigen expressed by cells of monocyte- 
macrophage derivation in the mouse, showed the presence 
35 of F480 positive cells in the marrow spaces. Focally, 
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flattened F480 positive cells could be seen directly 
adjacent to trabecular bone surfaces. 

The mesenchymal cells lining the bony 
trabeculae were flattened and appeared inactive. Based 
on KScE and TRAP stains, osteoclasts were rarely found 
on the trabecular bone surfaces in the OPG expressors. 
In contrast, osteoclasts and/or chondroclasts were seen 
in the region of the growth plate resorbing cartilage, 
but their numbers may be reduced compared to controls. 
Also, osteoclasts were present on the cortical surface 
of the metaphysis where modelling activity is usually 
robust. The predominant difference between the 
expressors and controls was the profound decrease in 
trabecular osteoclasts, both in the vertebrae and 
femurs. The extent of bone accumulation was directly 
correlated with the level of OPG transgene mRNA 
detected by northern blotting of total liver RNA. 

The spleens from the OPG expressors had an 
increased amount of red pulp with the expansion due to 
increased hematopoiesis . All hematopoietic lineages are 
represented. F480 positive cells were present in both 
control and OPG expressors in the red pulp. Two of the 
expressors (2 and 17) had foci of extramedullary 
hematopoiesis within the liver and this is likely due 
to the osteopetrotic marrow. 

There were no observable abnormalities in 
the thymus, lymph nodes, gastrointestinal tract, 
pancreato-hepatobiliary tract, respiratory tract, 
reproductive system, genito-urinary system, skin, 
nervous system, heart and aorta, breast, skeletal 
muscle and fat. 



EXAMPLE 5 

Isolation of mouse and human OPG cDNA 



A cDNA clone corresponding to the 5 ' end of 
the mouse OPG mRNA was isolated from a mouse kidney 
CDNA library (Clontech) by PGR amplification. The 
oligonucleotides were derived from the rat OPG cDNA 
sequence and are shown below: 

5 ' -ATCAAAGGCAGGGCATACTTCCTG-3 ' ( SEQ ID NO : 13 ) 
5 • -GTTGCACTCCTGTTTCACGGTCTG-3 ' (SEQ ID NO: 14} 

5 ' -CAAGACACCTTGAAGGGCCTGATG-3 ' { SEQ ID NO : 15 ) 
5 • -TAACTTTTACAGAAGAGCATCAGC-3 * ( SEQ ID NO : 16 ) 

5 ' -AGCGCGGCCGCATGAACAAGTGGCTGTGCTGCG-3 ' (SEQ ID NO : 17 ) 
5 ' -AGCTCTAGAGAAACAGCCCAGTGACCATTCC-3 ' ( SEQ ID NO : 18 )^ 

The partial and full-length cDNA products 
obtained in this process were sequenced. The full- 
length product was digested with Not* I and Xba I, then 
directionally cloned into the plasmid vector pRcCMV 
(Invitrogen) . The resulting plasmid was named pRcCMV- 
Mu-OPG, The nucleotide sequence of the cloned product 
was compared to the rat OPG cDNA sequence. Over the 
13 00 bp region spanning the OPG LORF, the rat and mouse 
DNA sequences are approximately 88% identical. The 
mouse cDNA sequence contained a 401 aa LORF, which was 
compared to the rat OPG protein sequence and found to 
be --94% identical without gaps. This indicates that 
the mouse cDNA sequence isolated encodes the murine OPG 
protein, and that the sequence and structure has been 
highly conserved throughout evolution. The mouse OPG 
protein sequence contains an identical putative signal 
peptide at its N- terminus, and all 4 potential sites of 
N-linked glycosylation are conserved. 
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A partial h\aman OPG cDNA was cloned from a 
human kidney cDNA library using the following rat- 
specific oligonucleotides : 

5 5'-GTG AAG CTG TGC AAG AAC CTG ATG-3 ' 

(SEQ ID NO: 19) 

5'-ATC AAA GGC AGG GCA TAG TTC CTG-3 ' 
(SEQ ID NO: 20} 

10 This PGR product was sequenced and used to 

design primers for amplifying the 3 ' end of the human 
cDNA using a human OPG genomic clone in lambda as 
template: 

15 5 ' -TCCGTAAGAAACAGCCCAGTGACC-3 ' (SEQ ID NO: 29) 
5 ' -CAGATCCTGAAGCTGCTCAGTTTG-3 ' (SEQ ID NO: 21) 

The amplified PGR product was sequenced, and 
together with the 5* end sequence, was used to design 
20 5' and 3' human-specific primers useful for amplifying 
the entire human OPG cDNA coding sequences : 

5 ' -AGCGCGGCCGCGGGGACCACAATGAACAAGTTG-3 ' (SEQ ID NO: 22) 
5' -AGCTCTAGAATTGTGAGGAAACAGCTCAATGGC-3 ' (SEQ ID NO: 23) 

25 

The full-length human PGR product was 
sequenced, then directionally cloned into the plasmid 
vector pRcCMV (Invitrogen) using Not I and Xba I. The 
resulting plasmid was named pRcCMV-htiman OPG. The 

3 0 nucleotide sequence of the cloned product was compared 
to the rat and mouse OPG cDNA sequences. Over the 13 00 
bp region spanning the OPG LORE, the rat and mouse DNA 
sequences are approximately 78-88% identical to the 
human OPG cDNA. The human OPG cDNA sequence also 

35 contained a 401 aa LORE, and it was compared to the rat 
and mouse protein sequences. The predicted hioman OPG 
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protein is approximatlely 85% identical, and --90% 
identical to the rat and mouse proteins, respectively. 
Sequence alignment of rat, mouse and human proteins 
show that they have been highly conserved during 
evolution. The human protein is predicted to have a N- 
terminal signal peptide, and 5 potential sites of N- 
linked glycosylation, 4 of which are conserved between 
the rat and mouse OPG proteins . 

The DNA and predicted amino acid sequence of 
mouse OPG is shown in Figure 9A and 9B {SEQ ID NO: 122) . 
The DNA and predicted amino acid sequence of human OPG 
is shown in Figure 9C an 9D (SEQ ID NO: 124) . A 
comparison of the rat, mouse and human OPG amino acid 
sequences is shown in Figure 9E and 9F. 

Isolation of additional human OPG cDNA clones 
revealed the presence of a G to C base change at 
position 103 of the DNA sequence shown in Figure 9C . 
This nucleotide change results in substitution of an 
asparagine for a lysine at position 3 of the amino acid 
sequence shown in Figure 9C. The remainder of the 
sequence in clones having this change was identical to 
that in Figure 9C and 9D, 

EXAMPLE 6 

OPG three-dimensional structure modelling 

The amino -terminal portion of OPG has 
homology to the extracellular portion of all known 
members of the TNFR superfamily (Figure IC) . The most 
notable motif in this region of TNFR-related genes is 
an --40 amino acid, cysteine-rich repeat sequence which 
folds into distinct structures (Banner et al. Cell 73., 
431-445 (1993)). This motif is usually displayed in 
four (range 3-6) tandem repeats (see Figure IC) , and is 
known to be involved in ligand binding (Beutler and van 
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Huffel Science 264. 667-663 (1994)). Each repeat 
usually contains six interspaced cysteine residues, 
which are involved in forming three intradomain 
disulfide bonds, termed SSI, SS2, and SS3 (Banner et 
al., ibid). in some receptors, such as TNFR2, CD3 0 and 
CD40, some of the repeat domains contain only two 
intrachain disulfide bonds (SSI and SS3) . 

The hximan OPG protein sequence was aligned to 
a TNFRl extracellular domain profile using methods 
described by Luethy, et al . , ibid, and the results were 
graphically displayed using the PrettyPlot program from 
the Wisconsin Package, version 8,1 (Genetics Computer 
Group, Madison, WI) (Figure 10). The alignment 
indicates a clear conservation of cysteine residues 
involved in formation of domains 1-4. This alignment 
was then used to construct a three-dimensional (3-D) 
model of the human OPG N-terminal domain using the 
known 3-D structure of the extracellular domain of p55 
TNFRl (Banner et al . , ibid) as the template. To do this 
the atomic coordinates of the peptide backbone and side 
chains of identical residues were copied from the 
crystal structure coordinates of TNFRl. Following 
this, the remaining coordinates for the insertions and 
different side chains were generated using the LOOK 
program (Molecular Applications Group, Palo Alto, CA) . 
The 3-D model was then refined by minimizing its 
conformational energy using LOOK. 

By analogy with other TNFR family members, it 
is assumed that OPG binds to a ligand. For the purpose 
of modelling the interaction of OPG with its ligand, 
the crystal structure of TNF-|3 was used to simulate a 
3-D representation of an "OPG ligand". This data was 
graphically displayed (see Figure 11) using Molscript 
(Kraulis, J. Appl . Cryst. 24/ 946-950, 1991). A model 
for the OPG/ ligand complex with 3 TNFp and 3 OPG 
molecules was constructed where the relative positions 
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of OPG are identical to TNFRl in the crystal structure. 
This model was then used to find the residues of OPG 
that could interact with its ligand using the following 
approach: The solvent accessible area of all residues 
5 in the complex and one single OPG model were 
calculated. The residues that have different 
accessibility in the complex than in the monomer are 
likely to interact with the ligand. 

The human and mouse OPG amino acid sequences 
10 were realigned using this information to highlight 

sequences comprising each of the cysteine rich domains 
1-4 (Figure 12A and 12B) . Each domain has individual 
structural characteristics which can be predicted: 

15 Domain 1 

Contains 4 cysteines involved in SS2 (C41 to 
C54) and SS3 (C44 to C62) disulfide bonds. Although no 
SSI bond is evident based on disulfide bridges, the 
conserved tyrosine at position 28 is homologous to Y2 0 

2 0 in TNFRl, which is known to be involved in interacting 
with H66 to aid in domain formation. OPG has a 
homologous histidine at position 15, suggesting OPG Y28 
and H75 stack together in the native protein, as do the 
homologous residues in TNFRl. Therefore, both of these 

25 residues may indeed be important for biological 

activity, and N- terminal OPG truncations up to and 
beyond Y28 may have altered activity. In addition, 
residues E34 and K43 are predicted to interact with a 
bound -ligand based on our 3-dimensional model. 

30 

Domain 2 

Contains six cysteines and is predicted to 
contain SSI {C65 to C80), SS2 (C83 to C98) and SS3 {C87 
to C105) disulfide bonds. This region of OPG also 
35 contains an region stretching from P66-Q91 which aligns 
to the portion of TNFRl domain 2 which forms close 
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contacts with TNFp (see above), and may interact with 
an OPG ligand. In particular residues P66, H68, Y69, 
Y70, T71, D72, S73, H75, T76, S77, D78, E79, L81, Y82, 
P85, V86, K88, E89, L90, and Q91 are predicted to 
interact with a bound ligand based on our structural 
data. 

Domain 3 

Contains 4 cysteines involved in SSI (C107 to 
C 118) and SS3 {C124 to C142) disulfide bonds, but not 
an SS2 bond. Based on our structural data, residues 
E115, L118 and K119 are predicted in to interact with 
an OPG ligand. 

Domain 4 

Contains 4 cysteines involved in SSI (C145 to 
C160) and SS3 (C166 to C185) disulfide bonds, but not 
an SS2 bond, similar to domain 3. Our structural data 
predict that E153 and S155 interact with an OPG ligand. 

Thus, the predicted structural model for OPG 
identifies a number of highly conserved residues which 
are likely to be important for its biological activity. 

EXAMPLE 7 
Production of recombinant secreted 
OPG protein in mammalian cells 

To determine if OPG is actually a secreted 
protein, mouse OPG cDNA was fused to the human IgGl Fc 
domain as a tag (Capon et al . Nature 337, 525-531 
(1989)), and expressed in human 293 fibroblasts. Fc 
fusions were carried out using the vector pFc-A3 . pFc- 
A3 contains the region encoding the Fc portion of human 
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immunoglobulin IgG-yl heavy chain (Ellison et al, ibid ) 
from the first amino acid of the hinge domain (Glu-99) 
to the carboxyl terminus and is flanked by a 5 ' -NotI 
fusion site and 3 ' -Sail and Xbal sites. The plasmid 
5 was constructed by PGR amplification of the human 

spleen cDNA library (Clontech) . PGR reactions were in a 
final vol\ime of 100 \il and employed 2 units of Vent DNA 
polymerase (New England Biolabs) in 20 mM Tris-HCl (pH 
8.8), 10 mM KCl, 10 \m (NH4)2S04, 2 mM MgS04, 0.1% 

10 Triton X-100 with 400 jlM each dNTP and 1 ng of the cDNA 
library to be amplified together with 1 jiM of each 
primer. Reactions were initiated by denaturation at 
95^C for 2 min, followed by 30 cycles of 95^G for 30 s, 
55 ^G for 3 0 s, and 73 °C for 2 min. The 5' primer 

15 5' ATAGCGGGGGGTGAGGGGAAATGTTGTGACAAAACTCAG 3* (SEQ 

ID NO: 24) 

incorporated a NotI site immediately 5' to the first 
residue (Glu-99) of the hinge domain of IgG-yl. The 3' 
primer 

2 0 5 ' -TCTAGAGTCGACTTATGATTTACCCGGAGACAGGGAGAGGCTCTT - 

3 ' (SEQ ID NO: 25) 

incorporated Sail and Xbal sites. The 717-bp PGR 
product was digested with NotI and Sail, isolated by 
electrophoresis through 1% agarose (FMC Gorp. ), purified 
25 by the Geneclean procedure (BIO 101, Inc.) and cloned 
into NotI, Sall-digested pBluescript II KS vector 
(Stratagene) . The insert in the resulting plasmid, pFc- 
A3, was sequenced to confirm the fidelity of the PGR 
reaction. 

30 The cloned mouse cDNA in plasmid pRcGMV-MuOPG 

was amplified using the following two sets of primer 
pairs : 

Pair 1 
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5 ' -CCTCTGAGCTCAAGCTTCCGAGGACCACAATGAACAAG-3 * (SEQ ID 
NO: 26) 

5 ' -CCTCTGCGGCCGCTAAGCAGCTTATTTTCACGGATTGAACCTG-3 ' ( SEQ 
ID NO: 27) 



Pair 2 

5 ' -CCTCTGAGCTCAAGCTTCCGAGGACCACAATGAACAAG-3 ' (SEQ ID 
NO: 28) 

10 5 ' -CCTCTGCGGCCGCTGTTGCATTTCCTTTCTG-3 ' { SEQ ID NO : 3 0 ) 

The first pair amplifies the entire OPG LORF, 
and creates a NotI restriction site which is compatible 
with the in-frame Not I site in Fc fusion vector pFcA3 . 

15 pFcA3 was prepared by engineering a NotI restriction 
site 5' to aspartic acid reside 216 of the human IgGl 
Fc cDNA, This construct introduces a linker which 
encodes two irrelevant amino acids which span the 
junction between the OPG protein and the IgG Fc region. 

20 This product, when linked to the Fc portion, would 

encode all 401 OPG residues directly followed by all 
227 amino acid residues of the human IgGl Fc region 
(Fl.Fc) . The second primer pair amplifies the DNA 
sequences encoding the first 180 amino acid residues of 

25 OPG, which encompasses its putative ligand binding 

domain. As above, the 3' primer creates an artificial 
Not I restriction site which fuses the C-terminal 
truncated OPG LORF at position threonine 180 directly 
to the IgGl Fc domain (CT.fc), 

3 0 The amino acid sequence junction linking OPG 

residue 401 and aseptic acid residue 221 of the human 
Fc region can be modified as follows: The DNA encoding 
residues 216-220 of the human Fc region can be deleted 
as described below, or the cysteine residue 

3 5 corresponding to C22 0 of the human Fc region can be 
mutated to either serine or alanine. OPF-Fc fusion 



- 58 - 



protein encoded by these modi fed vectors can be 
transfected into h\iman 2 93 cells, or CHO cells, and 
recombinant OPG-Fc fusion protein purified as described 
below. 

Both products were directionally cloned into 
the plasmid vector pCEP4 (Invitrogen) . pCEP4 contains 
the Epstein-Barr virus origin of replication, and is 
capable of episomal replication in 293-EBNA-l cells. 
The parent pCEP4, and pCEP4-Fl.Fc and pCEP4-CT.Fc 
vectors were lipofected into 293-EBNA-l cells using the 
manufacturer's recommended methods. The transfected 
cells were then selected in 100 fig/ml hygromycin to 
select for vector expression, and the resulting drug- 
resistant mass cultures were grown to confluence. The 
cells were then cultured in serum- free media for 72 hr, 
and the conditioned media removed and analysed by SDS- 
PAGE. A silver staining of the polyacrylamide gel 
detects the major conditioned media proteins produced 
by the drug resistant 293 cultures. In the pCEP4-Fl.Fc 
and the pCEP4-CT.Fc conditioned media, unique bands of 
the predicted sizes were abundantly secreted (see 
Figures 13B and 13C) . The full-length Fc fusion 
protein accumulated to a high concentration, indicating 
that it may be stable. Both Fc fusion proteins were 
detected by anti-human IgGl Fc antibodies (Pierce) on 
western blots, indicating that they are recombinant OPG 
products . 

The full length OPG-Fc fusion protein was 
purified by Protein-A column chromatography (Pierce) 
using the manufacturers recommended procedures. The 
protein was then subjected to N- terminal sequence 
analysis by automated Edman degradation as essentially 
described by Matsudaira et al . (J. Biol. Chem. 262 . 10- 
35 (1987)), The following amino acid sequence was read 
after 19 cycles: 
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NH2-E TLPPKYLHYDPETGHQL L-C02H 
(SEQ ID NO: 31) 

5 This sequence was identical to the predicted 

mouse OPG amino acid sequence beginning at amino acid 
residue 22, suggesting that the natural mammalian 
leader cleavage site is between amino acid residues 
Q21-E22, not between Y31-D32 as originally predicted. 
10 The expression experiments performed in 293-EBNA cells 
with pCEP4-Fl.Fc and pCEP4-CT.Fc demonstrate that OPG 
is a secreted protein, and may act systemically to bind 
its ligand. 

Procedures similar to those used to construct 
15 and express the muOPG [22-180 ] -Fc and muOPG[22-401] -Fc 
fusions were employed for additional mouse and human 
OPG-Fc fusion proteins . 

Murine OPG cDNA encoding amino acids 1-185 
fused to the Fc region of human IgGl [muOPG Ct(185).Fc] 
20 was constructed as follows. Murine OPG cDNA from 

plasmid pRcCMV Mu Osteoprotegerin (described in Example 
5) was amplified using the following primer pair in a 
polymerase chain reaction as described above: 

25 1333-82: 

5'-TCC CTT GCC CTG ACC ACT CTT-3 ' (SEQ ID NO: 32) 
1333-80: 

5'-CCT CTG CGG CCG CAC ACA CGT TGT CAT GTG TTG C- 
3' (SEQ ID NO: 33) 

30 

This primer pair amplifies the murine OPG 
cDNA region encoding amino acid residues 63-185 
(corresponding to bp 278-645) of the OPG reading frame 
as shown in Figure 9A. The 3' primer contains a Not I 
3 5 restriction site which is compatible with the in-frame 
Not I site of the Fc 'fusion vector pFcA3 . The product 
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also spans a unique EcoRI restriction site located at 
bp 436. The amplified PCR product was purified, 
cleaved with NotI and EcoRI, and the resulting EcoRI- 
Notl restriction fragment was purified. The vector 
pCEP4 having the murine 1-401 OPG-Fc fusion insert was 
cleaved with EcoRI and NotI, purified, and ligated to 
the PCR product generated above. The resulting pCEP4- 
based expression vector encodes OPG residues 1-185 
directly followed by all 227 amino acid residues of the 
human IgGl Fc region. The murine OPG 1-185. Fc fusion 
vector was transfected into 293 cells, drug selected, 
and conditioned media was produced as described above. 
The resulting secreted murine OPG 1-185. Fc fusion 
product was purified by Protein-A column chromatography 
(Pierce) using the manufacturers recommended 
procedures . 

Murine OPG DNA encoding amino acid residues 
1-194 fused to the Fc region of human IgGl (muOPG 
Ct(194).Fc) was constructed as follows. Mouse OPG cDNA 
from plasmid pRcCMV Mu-Osteoprotegerin was amplified 
using the following primer pairs: 

1333-82: 

5'-TCC CTT GCC CTG AGO ACT CTT-3 ' (SEQ ID NO: 34) 
1333-81: 

5'-CCT CTG CGG CCG CCT TTT GCG TGG CTT CTC TGT T- 
3' (SEQ ID NO:35} 

This primer pair amplifies the murine OPG 
cDNA region encoding amino acid residues 70-194 
(corresponding to bp 298-672) of the OPG reading frame. 
The 3 ' primer contains a Not I restriction site which 
is compatible with the in-frame Not I site of the Fc 
fusion vector pFcA3 . The product also spans a unique 
EcoRI restriction site located at bp 43 6. The 
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amplified PCR product was cloned into the murine 0PG[1- 
401] Fc fusion vector as described above. The 
resulting pCEP4-based expression vector encodes OPG 
residues 1-194 directly followed by all 227 amino acid 
5 residues of the human IgGl Fc region. The murine OPG 
1-194. Fc fusion vector was transfected into 293 cells, 
drug selected, and conditioned media was produced. The 
resulting secreted fusion product was purified by 
Protein-A column chromatography (Pierce) using the 
10 manufacturers recommended procedures. 

Human OPG DNA encoding amino acids 1-401 
fused to the Fc region of human IgGl was constructed as 
follows . Hinnan OPG DNA in plasmid pRcCMV-hu 
15 osteoprotegerin {described in Example 5) was amplified 
using the following oligonucleotide primers: 

1254-90: 

5'CCT CTG AGC TCA AGC TTG GTT TCC GGG GAC CAC AAT G-3 ' 
20 (SEQ ID NO:36) 

1254-95: 

5'-CCT CTG CGG CCG CTA AGC AGC TTA TTT TTA CTG AAT GG-3 ' 
(SEQ ID NO: 37) 

25 

The resulting PCR product encodes the full- 
length hximan OPG protein and creates a Not I 
restriction site which is compatible with the in-frame 
Not I site Fc fusion vector FcA3 . The PCR product was 

30 directionally cloned into the plasmid vector pCEP4 as 
described above. The resulting expression vector 
encodes human OPG residues 1-401 directly followed by 
227 amino acid residues of the human IgGl Fc region. 
Conditioned media from transfected and drug selected 

3 5 cells was produced and the huOPG Fl.Fc fusion product 
was purified by Protein-A column chromatography 
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(Pierce) using the manufacturers recommended 
procedures . 

Hximan OPG DNA encoding amino acid residues 1- 
2 01 fused to the Fc region of human IgGl [huOPG 
Ct{201).Fc] was constructed as follows. The cloned 
human OPG cDNA from plasmid pRrCMV-hu osteoprotegerin 
was amplified by PGR using the following 
oligonucleotide primer pair: 

1254-90: 

5' -COT CTG AGO TCA AGO TTG GTT TCC GGG GAG CAC AAT 
G-3 ' (SEQ ID NO: 38) 
1254-92: 

5'-CCT CTG CGG CCG CCA GGG TAA CAT CTA TTC CAC-3 ' 
(SEQ ID NO: 39) 

This primer pair amplifies the human OPG cDNA 
region encoding amino acid residues 1-2 01 of the OPG 
20 reading frame, and creates a Not I restriction site at 
the 3 ' end which is compa table with the in- frame Not I 
site Fc fusion vector FcA3 . This product, when linked 
to the Fc portion, encodes OPG residues 1-201 directly 
followed by all 221 amino acid residues of the human 
25 IgGl Fc region. The PGR product was directionally 
cloned into the plasmid vector pCEP4 as described 
above. Conditioned media from transfected and drug 
selected cells was produced, and the hu OPG Ct(201) .Fc 
fusion products purified by Protein-A column 
3 0 chromatography (Pierce) using the manufacturer's 
recommended procedures . 

The following procedures were used to 
construct and express unfused mouse and hiiman OPG. 
3 5 A plasmid for mammalian expression of full- 

length murine OPG (residues 1-401) was generated by PGR 
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amplification of the murine OPG cDNA insert from pRcCMV 
Mu-Osteoprotegerin and subcloned into the expression 
vector pDSRa (DeClerck et. atl, J. Biol. Chem, 266 . 
3 893 (1991) ) , The following oligonucleotide primers 
5 were used: 

1295-26: 

5'-CCG AAG CTT CCA CCA TGA ACA AGT GGC TGT GCT 
GC-3' (SEQ ID NO: 40) 

10 

1295-27: 

5'-CCT CTG TCG ACT ATT ATA AGC AGC TTA TTT TCA CGG 
ATT G-3' {SEQ ID NO: 41) 

15 The murine OPG full length reading frame was 

amplified by PGR as described above • The PGR product 
was purified and digested with restriction 
endonucleases Hind III and Xba I (Boehringer Mannheim, 
Indianapolis, IN) under the manufacturers recommended 

20 conditions, then ligated to Hind III and Xba I digested 
pDSRa. Recombinant clones were detected by restriction 
endonuclease digestion, then sequenced to ensure no 
mutations were produced during the PGR amplification 
steps . 

25 The resulting plasmid, pDSRa-muOPG was 

introduced into Chinese hamster ovary (CHO) cells by 
calcium mediated transfection (Wigler et al. Cell 11, 
233 (1977)). Individual colonies were selected based 
upon expression of the dihydrof olate reductase (DHFR) 

30 gene in the plasmid vector and several clones were 
isolated. Expression of the murine OPG recombinant 
protein was monitored by western blot analysis of CHO 
cell conditioned media. High expressing cells were 
selected, and OPG expression was further amplified by 

3 5 treatment with methotrexate as described (DeClerck et 
al . , idid) . Conditioned media from CHO cell lines was 
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produced for further purification of recombinant 
secreted murine OPG protein. 

A plasmid for mammalian expression of full- 
5 length human OPG (amino acids 1-401) was generated by 
subcloning the cDNA insert in pRcCMV-hu Osteoprotegerin 
directly into vector pDSRa (DeClerck et al , , ibid). The 
pRcCMV-OPG plasmid was digested to completion with Not 
I, blunt ended with Klenow, then digested to completion 

10 with Xba I. Vector DNA was digested with Hind III, 

blunt ended with Klenow, then digested with Xba I, then 
ligated to the OPG insert. Recombinant plasmids were 
then sequenced to confirm proper orientation of the 
human OPG cDNA. 

15 The resulting plasmid pDSRa-huOPG was 

introduced into Chinese hamster ovary (CHO) cells as 
described above. Individual colonies were selected 
based upon expression of the dihydrof olate reductase 
(DHFR) gene in the plasmid vector and several clones 

20 were isolated. Expression of the human OPG recombinant 
protein was monitored by western blot analysis of CHO 
cell conditioned media. High expressing clones were 
selected/ and OPG expression was further amplified by 
treatment with methotrexate. Conditioned media from 

25 CHO cell lines expressing human OPG was produced for 
protein purification. 

Expression vectors for murine OPG encoding 
residues 1-185 were constructed as follows. Murine OPG 
3 0 cDNA from pRcCMV-Mu OPG was amplified using the 
following oligonucleotide primers: 

1333-82 : 

5'-TCC CTT GCC CTG ACC ACT CTT-3 ' (SEQ ID NO: 42) 
35 1356-12: 
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5'-CCT CTG TCG ACT TAA CAC ACG TTG TCA TGT GTT 
GC-3' (SEQ ID NO: 43) 

This primer pair amplifies the murine OPG 
5 cDNA region encoding amino acids 63-185 of the OPG 

reading frame (bp 278-645) and contains an artificial 
stop codon directly after the cysteine codon (C185), 
which is followed by an artificial Sal I restriction 
endonuclease site. The predicted product contains an 

10 internal Eco RI restriction site useful for subcloning 
into a pre-existing vector. After PGR amplification, 
the resulting purified product was cleaved with Eco RI 
and Sal I restriction endonuc leases, and the large 
fragment was gel purified. The purified product was 

15 then subcloned into the large restriction fragment of 
an Eco RI and Sal I digest of pBluescript-muOPG Fl.Fc 
described above. The resulting plasmid was digested 
with Hind III and Xho I and the small fragment was gel 
purified. This fragment, which contains a open reading 

2 0 frame encoding residues 1-185 was then subcloned into a 
Hind III and Xho I digest of the expression vector 
pCEP4. The resulting vector, pmuOPG [1-185], encodes a 
truncated OPG polypeptide which terminates at a 
cysteine residue located at position 185. Conditioned 

25 media from transfected and drug selected cells was 
produced as described above. 



1333-82: 

30 5'-TCC CTT GCC CTG ACC ACT CTT-3 ' (SEQ ID NO: 44) 

1356-13: 

5'-CCT CTG TCG ACT TAC TTT TGC GTG OCT TCT CTG 
TT-3' (SEQ ID NO: 45) 



35 
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This primer pair amplifies the murine OPG 
cDNA region encoding amino acids 70-194 of the OPG 
reading frame (bp 298-672) and contains an artificial 
stop codon directly after the lysine codon {K194) , 
which is followed by an artificial Sal I restriction 
endonuclease site. The predicted product contains an 
internal Eco RI restriction site useful for subcloning 
into a pre-existing vector. After PGR amplification; 
the resulting purified product was cleaved with Eco RI 
and Sal I restriction endonuc leases , and the large 
fragment was gel purified. The purified product was 
then subcloned into the large restriction fragment of 
an Eco RI and Sal I digest of pBluescript-muOPG Fl.Fc 
described above. The resulting plasmid was digested 
with Hind III and Xho I and the small fragment was gel 
purified. This fragment, which contains a open reading 
frame encoding residues 1-185 was then subcloned into a 
Hind III and Xho I digest of the expression vector 
pCEP4. The resulting vector, pmuOPG [1-185], encodes a 
truncated OPG polypeptide which terminates at a lysine 
at position 194. Conditioned media from transfected 
and drug selected cells was produced as described 
above . 

Several mutations were generated at the 5 ' 
end of the huOPG [22-401] -Fc gene that introduce either 
amino acid substitutions, or deletions, of OPG between 
residues 22 through 32. All mutations were generated 
with the ^^QuiclcChange'™ 'Site-Directed Mutagenesis Kit" 
(Stratagene, San Diego, CA) using the manf acturer ' s 
recommended conditions. Briefly, reaction mix 
containing huOPG [22-401] -Fc plasmid DNA template and 
mutagenic primers were treated with Pfu polymerase in 
the presence of deoxynucleotides, then amplified in a 
thermocycler as described above. An aliqout of the 
reaction is then transfected into competent E. coli 
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XLl-Blue by heatshock, then plated. Plasmid DNA from 
trans formants was then sequenced to verify mutations. 

The following primer pairs were used to 
delete residues 22-26 of the hximan OPG gene, resulting 
5 in the production of a huOPG [27-401] -Fc fusion 
protein: 

1436-11: 

5'-TGG ACC ACC CAG AAG TAG CTT CAT TAT GAC-3 ' 
10 (SEQ ID NO: 140) 

1436-12: 

5'-GTC ATA ATG AAG GTA CTT CTG GGT GGT CCA-3 ' 
(SEQ ID NO: 141) 



15 



20 



The following primer pairs were used to 
delete residues 22-28 of the human OPG gene, resulting 
in the production of a huOPG [29-401] -Fc fusion 
protein: 

1436-17: 

5'-GGA CCA CCC AGC TTC ATT ATG ACG AAG AAA C-3 ' 
(SEQ ID NO: 142) 



25 1436-18: 

5'-GTT TCT TCG TCA TAA TGA AGC TGG GTG GTC C-3' 
(SEQ ID NO: 143) 

The following primer pairs were used to 
30 delete residues 22-31 of the human OPG gene, resulting 
in the production of a huOPG [32-401] -Fc fusion 
protein: 
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1436-27: 

5'-GTG GAG CAC CCA GGA CGA AGA AAC CTC TC-3 ' 
(SEQ ID NO: 144) 

5 1436-28: 

5 '-GAG AGG TTT CTT CGT CCT GGG TGG TCC AC-3 ' 
(SEQ ID NO: 145) 

The following primer pairs were used to 
10 change the codon for tyrosine residue 28 to 

phenylalanine of the human OPG gene, resulting in the 
production of a huOPG [22-401] -Fc Y28F fusion protein 

1436-29: 

15 5' -CGT TTC CTC CAA AGT TCC TTC ATT ATG AC-3 ' 

(SEQ ID NO: 146) 

1436-30: 

5 ' -GTC ATA ATG AAG GAA CTT TGG AGG AAA CG-3 ' 
20 (SEQ ID NO:147) 

The following primer pairs were used to 
change the codon for proline residue 2 6 to alanine of 
the human OPG gene, resulting in the production of a 
25 huOPG [22-401] -Fc P26A fusion protein: 

1429-83 : 

5' -GGA AAC GTT TCC TGC AAA GTA CCT TCA TTA TG-3 
(SEQ ID NO: 148) 

30 

1429-84: 

5 '-CAT AAT GAA GGT ACT TTG CAG GAA ACG TTT CC-3 ' 
(SEQ ID NO: 149) 



35 Each resulting muOPG [22-401] -Fc plasmid 

containing the appropriate mutation was then 
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transfected into human 293 cells, the mutant OPG-Fc 
fusion protein purified from conditioned media as 
described above. The biological activity of each 
protein was assessed the in vitro osteoclast forming 
5 assay described in Example 11, 



EXAMPLE 8 
Expression of OPG in E. coli 

A, Bacterial Expression Vectors 

PAMG21 

The expression plasmid pAMG21 can be derived 
from the Amgen expression vector pCFM1656 (ATCC #6957 6) 
which in turn be derived from the Amgen expression 
vector system described in US Patent No. 4,710,473. 
The PCFM165 6 plasmid can be derived from the described 
PCFM83 6 plasmid (Patent No. 4,710,473) by: (a) 
destroying the two endogenous Ndel restriction sites 
by end filling with T4 polymerase enzyme followed by 
blunt end ligation; (b) replacing the DNA sequence 
between the unique Aatll and Clal restriction sites 
containing the synthetic Pl promoter with a similar 
fragment obtained from pCFM636 (patent No. 4,710,473) 
containing the PL promoter 



Aatll 

3 0 5 ' CTAATTCCGCTCTCACCTACCAAACAATGCCCCCCTGCAAAAAATAAATTCATAT- 
3 ' TGCAGATTAAGGCGAGAGTGGATGGTTTGTTACGGGGGGACGTTTTTTATTTAAGTATA- 



-AAAAAACATACAGATAACCATCTGCGGTGATAAATTATCTCTGGCGGTGTTGACATAAA- 
-TTTTTTGTATGTCTATTGGTAGACGCCACTATTTAATAGAGACCGCCACAACTGTATTT- 

-TACCACTGGCGGTGATACTGAGCACAT 3' ( SEQ ID NO: 53) 
-ATGGTGACCGCCACTATGACTCGTGTAGC5 ' (SEQ ID NO: 54) 

Clal 
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and then (c) substituting the small DNA sequence 
between the unique Clal and Kpnl restriction sites 
with the following oligonucleotide: 

5 

5 ' CGATTTGATTCTAGAAGGAGGAATAACATATGGTTAACGCGTTGGAATTCGGTAC3 ' 

(SEQ ID NO: 48) 

3 ' TAAACTAAGATCTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGC 5 ' 

10 (SEQ ID NO: 49) 

Clal Kpnl 

The expression plasmid pAMG21 can then be derived from 
pCFMl656 by making a series of site directed base 
changes by PGR overlapping oligo mutagenesis and DNA 
sequence substitutions* Starting with the Bglll site 
(plasmid bp # 180) immediately 5' to the plasmid 
replication promoter PcopB and proceeding toward the 
plasmid replication genes, the base pair changes are as 
follows : 



15 



20 



PAMG21 bp # bp in DCFM1656 bp changed to in PAMG21 



40 



# 


204 


T/A 


C/G 


# 


428 


A/T 


G/C 


# 


509 


G/C 


A/T 


# 


617 




insert two G/C 


# 


579 


G/C 


T/A 


# 


980 


T/A 


C/G 


# 


994 


G/C 


A/T 


# 


1004 


A/T 


C/G 


# 


1007 


C/G 


T/A 


# 


1028 


A/T 


T/A 


# 


1047 


C/G 


T/A 


# 


1178 


G/C 


T/A 


# 


1466 


G/C 


T/A 


# 


2028 


G/C 


bp deletion 


# 


2187 


C/G 


T/A 


# 


2480 


A/T 


T/A 


# 


2499-2502 


AGTG 


GTCA 






TCAC 


CAGT 


# 


2642 


TCCGAGC 


7 bp deletion 






AGGCTCG 





# 3435 

# 3446 

# 3643 



G/C 
G/C 
A/T 



A/T 
A/T 
T/A 
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The DNA sequence between the unique Aatll (position 
#4364 in pCFM1656) and SacII (position #4585 in 
5 pCFM1656) restriction sites is substituted with the 
following DNA sequence: 



[Aatll sticky end] 5' GCGTAACGTATGCATGGTCTCC- 

10 (position #4358 in pAMG21) 3' TGCACGCATTGCATACGTACCAGAGG- 

-CCATGCGAGAGTAGGGAACTGCCAGGCATCAAATAAAACGAAAGGCTCAGTCGAAAGACT- 
-GGTACGCTCTCATCCCTTGACGGTCCGTAGTTTATTTTGCTTTCCGAGTCAGCTTTCTGA- 

15 -GGGCCTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTCCTGAGTAGGACAAATCCGC- 
-CCCGGAAAGCAAAATAGACAACAAACAGCCACTTGCGAGAGGACTCATCCTGTTTAGGCG- 

-CGGGAGCGGATTTGAACGTTGCGAAGCAACGGCCCGGAGGGTGGCGGGCAGGACGCCCGC- 
-GCCCTCGCCTAAACTTGCAACGCTTCGTTGCCGGGCCTCCCACCGCCCGTCCTGCGGGCG- 

20 

-CATAAACTGCCAGGCATCAAATTAAGCAGAAGGCCATCCTGACGGATGGCCTTTTTGCGT- 
-GTATTTGACGGTCCGTAGTTTAATTCGTCTTCCGGTAGGACTGCCTACCGGAAAAACGCA- 

Aatll 

2 5 -TTCTACAAACTCTTTTGTTTATTTTTCTAAATACATTCAAATATGGACGTCGTACTTAAC- 

-AAGATGTTTGAGAAAACAAATAAAAAGATTTATGTAAGTTTATACCTGCAGCATGAATTG- 

-TTTTAAAGTATGGGCAATCAATTGCTCCTGTTAAAATTGCTTTAGAAATACTTTGGCAGC- 
-AAAATTTCATACCCGTTAGTTAACGAGGACAATTTTAACGAAATCTTTATGAAACCGTCG- 

30 

-GGTTTGTTGTATTGAGTTTCATTTGCGCATTGGTTAAATGGAAAGTGACCGTGCGCTTAC- 
-CCAAACAACATAACTCAAAGTAAACGCGTAACCAATTTACCTTTCACTGGCACGCGAATG- 

-TACAGCCTAATATTTTTGAAATATCCCAAGAGCTTTTTCCTTCGCATGCCCACGCTAAAC- 

3 5 -ATGTCGGATTATAAAAACTTTATAGGGTTCTCGAAAAAGGAAGCGTACGGGTGCGATTTG- 

-ATTCTTTTTCTCTTTTGGTTAAATCGTTGTTTGATTTATTATTTGCTATATTTATTTTTC- 
-TAAGAAAAAGAGAAAACCAATTTAGCAACAAACTAAATAATAAACGATATAAATAAAAAG- 

4 0 -GATAATTATCAACTAGAGAAGGAACAATTAATGGTATGTTCATACACGCATGTAAAAATA- 

-CTATTAATAGTTGATCTCTTCCTTGTTAATTACCATACAAGTATGTGCGTACATTTTTAT- 

-AACTATCTATATAGTTGTCTTTCTCTGAATGTGCAAAACTAAGCATTCCGAAGCCATTAT- 
-TTGATAGATATATCAACAGAAAGAGACTTACACGTTTTGATTCGTAAGGCTTCGGTAATA- 

45 

-TAGCAGTATGAATAGGGAAACTAAACCCAGTGATAAGACCTGATGATTTCGCTTCTTTAA- 
-ATCGTCATACTTATCCCTTTGATTTGGGTCACTATTCTGGACTACTAAAGCGAAGAAATT- 

-TTACATTTGGAGATTTTTTATTTACAGCATTGTTTTCAAATATATTCCAATTAATCGGTG- 

5 0 - AATGTAAACCTCTAAAAAATAAATGTCGTAAC AAAAGTTTATATAAGGTTAATTAGCCAC - 

-AATGATTGGAGTTAGAATAATCTACTATAGGATCATATTTTATTAAATTAGCGTCATCAT- 
-TTACTAACCTCAATCTTATTAGATGATATCCTAGTATAAAATAATTTAATCGCAGTAGTA- 

5 5 -AATATTGCCTCCATTTTTTAGGGTAATTATCCAGAATTGAAATATCAGATTTAACCATAG- 
-TTATAACGGAGGTAAAAAATCCCATTAATAGGTCTTAACTTTATAGTCTAAATTGGTATC- 

-AATGAGGATAAATGATCGCGAGTAAATAATATTCACAATGTACCATTTTAGTCATATCAG- 
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-TTACTCCTATTTACTAGCGCTCATTTATTATAAGTGTTACATGGTAAAATCAGTATAGTC- 

-ATAAGCATTGATTAATATCATTATTGCTTCTACAGGCTTTAATTTTATTAATTATTCTGT- 
-TATTCGTAACTAATTATAGTAATAACGAAGATGTCCGAAATTAAAATAATTAATAAGACA- 

5 

-AAGTGTCGTCGGCATTTATGTCTTTCATACCCATCTCTTTATCCTTACCTATTGTTTGTC- 
-TTCACAGCAGCCGTAAATACAGAAAGTATGGGTAGAGAAATAGGAATGGATAACAAACAG- 

-GCAAGTTTTGCGTGTTATATATCATTAAAACGGTAATAGATTGACATTTGATTCTAATAA- 
1 0 -CGTTCAAAACGCACAATATATAGTAATTTTGCCATTATCTAACTGTAAACTAAGATTATT- 

-ATTGGATTTTTGTCACACTATTATATCGCTTGAAATACAATTGTTTAACATAAGTACCTG- 
-TAACCTAAAAACAGTGTGATAATATAGCGAACTTTATGTTAACAAATTGTATTCATGGAC- 

1 5 -TAGGATCGTACAGGTTTACGCAAGAAAATGGTTTGTTATAGTCGATTAATCGATTTGATT- 
-ATCCTAGCATGTCCAAATGCGTTCTTTTACCAAACAATATCAGCTAATTAGCTAAACTAA- 

-CTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGTTAACGCGTTGGAATTCGA- 
-GATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGCT- 

20 

Sacll 

-GCTCACTAGTGTCGACCTGCAGGGTACCATGGAAGCTTACTCGAGGATCCGCGGAAAGAA- 
-CGAGTGATCACAGCTGGACGTCCCATGGTACCTTCGAATGAGCTCCTAGGCGCCTTTCTT- 

2 5 -GAAGAAGAAGAAGAAAGCCCGAAAGGAAGCTGAGTTGGCTGCTGCCACCGCTGAGCAATA- 
-CTTCTTCTTCTTCTTTCGGGCTTTCCTTCGACTCAACCGACGACGGTGGCGACTCGTTAT- 

-ACTAGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGG- 
-TGATCGTATTGGGGAACCCCGGAGATTTGCCCAGAACTCCCCAAAAAACGACTTTCCTCC- 

30 

-AACCGCTCTTCACGCTCTTCACGC 3' [SacII Sticky end] (SEQ ID NO: 50) 
-TTGGCGAGAAGTGCGAGAAGTG 5' (position #5904 in pAMG21) {SEQ ID 
NO: 46) 



3 5 During the ligation of the sticky ends of this 

substitution DNA sequence, the outside Aatll and SacII 
sites are destroyed. There are unique Aatll and SacII 
sites in the substituted DNA. 



40 PAMG22-His 

The expression plasmid pAMG22-His can be derived from 
the Amgen expression vector pAMG22 by substituting the 
small DNA sequence between the unique Ndel { #4795) and 
EcoRI ( #4818) restriction sites of pAMG22 with the 

45 following oligonucleotide duplex: 



Ndel Nhel EcoRI 

5 ' TATGAAACATCATCACCATCACCATCATGCTAGCGTTAACGCGTTGG 3 ' 

(SEQ ID NO: 51) 

50 3' ACTTTGTAGTAGTGGTAGTGGTAGTACGATCGCAATTGCGCAACCTTAA 5' 



(SEQ ID NO: 52) 
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MetLysHisHisHisHisHisHisHisAlaSerValAsnAlaLeuGlu 
(SEQ ID NO: 168) 

PAMG22 

5 The expression plasmid pAMG22 can be derived from the 
Amgen expression vector pCFM1656 (ATCC #69576) which in 
turn be derived from the Amgen expression vector system 
described in US Patent No. 4,710,473 granted December 
1, 1987. The pCFMl656 plasmid can be derived from the 

10 described pCFM83 6 plasmid (Patent No, 4,710,473) by: 
(a) destroying the two endogenous Ndel restriction 
sites by end filling with T4 polymerase enzyme followed 
by blunt end ligation; (b) replacing the DNA sequence 
between the unique Aatll and Clal restriction sites 

15 containing the synthetic PL promoter with a similar 
fragment obtained from pCFM636 (patent No, 4,710,473) 
containing the PL promoter 

Aatll 

2 0 5 ' CTAATTCCGCTCTCACCTACCAAACAATGCCCCCCTGCAAAAAATAAATTCATAT - 

3 ' TGCAGATTAAGGCGAGAGTGGATGGTTTGTTACGGGGOGACGTTTTTTATTTAAGTATA- 

-AAAAAACATACAGATAACCATCTGCGGTGATAAATTATCTCTGGCGGTGTTGACATAAA- 
-TTTTTTGTATGTCTATTGGTAGACGCCACTATTTAATAGAGACCGCCACAACTGTATTT- 

25 

-TACCACTGGCGGTGATACTGAGCACAT 3' (SEQ ID NO: 53) 
-ATGGTGACCGCCACTATGACTCGTGTAGCS ' ( SEQ ID NO : 54 ) 

ClaX 

3 0 and then (c) substituting the small DNA sequence 

between the unique ClaJ and Kpnl restriction sites 
with the following oligonucleotide: 

5 ' CGATTTGATTCTAGAAGGAGGAATAACATATGGTTAACGCGTTGGAATTCGGTAC 3 ' 
35 (SEQ ID NO:55) 
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3 ' TAAACTAAGATCTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGC 
(SEQ ID NO:56) 

Clal Kpnl 



10 



The expression plasmid pAMG22 can then be derived from 
pCFMl656 by making a series of site directed base 
changes by PCR overlapping oligo mutagenesis and DNA 
sequence substitutions. Starting with the Bglll site 
(plasmid bp # 180) immediately 5' to the plasmid 
replication promoter PcopB and proceeding toward the 
plasmid replication genes, the base pair changes are as 
follows ; 



15 



PAMG22 bp # bp in PCFM1656 



bp changed to in 
PAMG22 



20 



25 



30 



35 



40 



# 204 

# 428 

# 509 

# 617 



T/A 
A/T 
G/C 



C/G 
G/C 
A/T 

insert two G/C 
bp 



# 679 


G/C 


T/A 


# 980 


T/A 


C/G 


# 994 


G/C 


A/T 


# 1004 


A/T 


C/G 


# 1007 


C/G 


T/A 


# 1028 


A/T 


T/A 


# 1047 


C/G 


T/A 


# 1178 


G/C 


T/A 


# 1466 


G/C 


T/A 


# 2028 


G/C 


bp deletion 


# 2187 


C/G 


T/A 


# 2480 


A/T 


T/A 


# 2499-2502 


AGTG 


GTCA 




TCAC 


CAGT 


# 2642 


TCCGAGC 


7 bp deletion 




AGGCTCG 




# 3435 


G/C 


A/T 


# 3446 


G/C 


A/T 


# 3643 


A/T 


T/A 



45 
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The DNA sequence between the unique Aatll (position 
#4364 in pCFM1656) and SacII (position #4585 in 
PCFM1656) restriction sites is substituted with the 
following DNA sequence: 

5 

[Aatll sticky end] (position #43 58 in pAMG22) 



5 • GCGTAACGTATGCATGGTCTCCCCATGCGAGAGTAGGGAACTGCCAGGCATCAA- 
3 ' TGCACGCATTGCATACGTACCAGAGGGGTACGCTCTCATCCCTTGACGGTCCGTAGTT- 

10 

-ATAAAACGAAAGGCTCAGTCGAAAGACTGGGCCTTTCGTTTTATCTGTTGTTTGTCGGTG- 
-TATTTTGCTTTCCGAGTCAGCTTTCTGACCCGGAAAGCAAAATAGACAACAAACAGCCAC- 



1 5 -AACGCTCTCCTGAGTAGGACAAATCCGCCGGGAGCGGATTTGAACGTTGCGAAGCAACGG- 
-TTGCGAGAGGACTCATCCTGTTTAGGCGGCCCTCGCCTAAACTTGCAACGCTTCGTTGCC- 

-CCCGGAGGGTGGCGGGCAGGACGCCCGCCATAAACTGCCAGGCATCAAATTAAGCAGAAG- 
-GGGCCTCCCACCGCCCGTCCTGCGGGCGGTATTTGACGGTCCGTAGTTTAATTCGTCTTC- 

20 

-GCCATCCTGACGGATGGCCTTTTTGCGTTTCTACAAACTCTTTTGTTTATTTTTCTAAAT- 
-CGGTAGGACTGCCTACCGGAAAAACGCAAAGATGTTTGAGAAAACAAATAAAAAGATTTA- 

Aatll 

2 5 - ACATTCAAATATGGACGTCTCATAATTTTTAAAAAATTCATTTGACAAATGCTAAAATTC - 

-TGTAAGTTTATACCTGCAGAGTATTAAAAATTTTTTAAGTAAACTGTTTACGATTTTAAG- 

-TTGATTAATATTCTCAATTGTGAGCGCTCACAATTTATCGATTTGATTCTAGATTTGTTT- 
-AACTAATTATAAGAGTTAACACTCGCGAGTGTTAAATAGCTAAACTAAGATCTAAACTCA- 

30 

-TAACTAATTAAAGGAGGAATAACATATGGTTAACGCGTTGGAATTCGAGCTCACTAGTGT- 
-ATTGATTAATTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGCTCGAGTGATCACA- 

SacII 

3 5 -CGACCTGCAGGGTACCATGGAAGCTTACTCGAGGATCCGCGGAAAGAAGAAGAAGAAGAA- 

-GCTGGACGTCCCATGGTACCTTCGAATGAGCTCCTAGGCGCCTTTCTTCTTCTTCTTCTT- 

-GAAAGCCCGAAAGGAAGCTGAGTTGGCTGCTGCCACCGCTGAGCAATAACTAGCATAACC- 
-CTTTCGGGCTTTCCTTCGACTCAACCGACGACGGTGGCGACTCGTTATTGATCGTATTGG- 

40 

-CCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGGAACCGCTCTTCA- 
-GGAACCCCGGAGATTTGCCCAGAACTCCCCAAAAAACGACTTTCCTCCTTGGCGAGAAGT- 

-CGCTCTTCACGC 3' (SEQ ID NO: 58) 
45 -GCGAGAAGTG 5* (SEQ ID NO: 57) 

[SacIX sticky end] (position #5024 in pAMG22) 



During the ligation of the sticky ends of this 
50 substitution DNA sequence, the outside Aatll and SacII 
sites are destroyed. There are unique Aatll and SacII 
sites in the substituted DNA. 
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B. Human OPG Met [32-4011 

In the example, the expression vector used 
was pAMG21, a derivative of pCFM1656 (ATCC accession 
no. 69576) which contains appropriate restriction sites 
5 for insertion of genes downstream from the lux PR 
promoter. {See U.S. Patent No. 5,169,318 for 
description of the lux expression system) . The host 
cell used was GM120 (ATCC accession no. 55764) . This 
host has the lacIQ promoter and lad gene integrated 

10 into a second site in the host chromosome of a 

prototrophic coli K12 host. Other commonly used E. 
coli expression vectors and host cells are also 
suitable for expression. 

A DNA sequence coding for an N-terminal 

15 methionine and amino acids 32-401 of the human OPG 
polypeptide was placed under control of the luxPR 
promoter in the plasmid expression vector pAMG21 as 
follows. To accomplish this, PGR using 
oligonucleotides #1257-20 and #1257-19 as primers was 

20 performed using as a template plasmid pRcCMV-Hu OPG DNA 
containing the human OPG cDNA and thermocycling for 3 0 
cycles with each cycle being: 94°C for 20 seconds, 
followed by 37^C for 30 seconds, followed by 12^C for 
3 0 seconds. The resulting PCR sample was resolved on 

25 an agarose gel^ the PCR product was excised, purified, 
and restricted with Kpnl and BamHI restriction 
endonucleases and purified. Synthetic oligonucleotides 
#1257-21 and #1257-22 were phophorylated individually 
using T4 polynucleotide kinase and ATP, and were then 

30 mixed together, heated at 94°C and allowed to slow cool 
to room temperature to form an oligonucleotide linker 
duplex containing Ndel and Kpnl sticky ends. The 
phosphorylated linker duplex formed between 
oligonucleotides #1257-21 and #1257-22 containing Ndel 

3 5 and Kpnl cohesive ends (see Figure 14A) and the Kpnl 
and BamHI digested and purified PCR product generated 
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using oligo primers #1257-20 and #1257-19 (see above) 
was directionally inserted between two sites of the 
plasmid vector pAMG21, namely the Ndel site and BamHI 
site, using standard recombinant DNA methodology (see 
5 Figure 14A and sequences below) . The synthetic linker 
utilized E. coli codons and provided for a N- terminal 
methionine . 

Two clones were selected and plasmid DNA 
isolated, and the human OPG insert was subsequently DNA 
10 sequence confiirmed. The resulting pAMG21 plasmid 
containing amino acids 32-401 of the human OPG 
polypeptide immediately preceded in frame by a 
methionine is referred to as pAMG21-huOPG met [32-401] 
or pAMG21-huOPG met [32-401] . 

15 

Oligo#1257-19 

5 • -TACGCACTGGATCCTTATAAGCAGCTTATTTTTACTGATTGGAC-3 ' 
(SEQ ID NO: 59) 

20 Oligo#1257-20 

5 • -GTCCTCCTGGTACCTACCTAAAACAAC-3 ' ( SEQ ID NO : 6 0 ) 

Oligo#1257-21 

5 • -TATGGATGAAGAAACTTCTCATCAGCTGCTGTGTGATAAATGTCC 
25 GCCGGGTAC -3' (SEQ ID NO: 51) 

01igo#1257-22 

5 ' -CCGGCGGACATTTATCACACAGCAGCTGATGAGAAGTTTCTTCATCCA-3 ' 
(SEQ ID NO: 47) 

30 

Cultures of pAMG21-huOPG met [32-401] in E. 
coli GM12 0 in 2XYT media containing 20 M-g/ml Jcanamycin 
were incubated at 3 0^C prior to induction. Induction 
of huOPG met [32-401] gene product expression from the 
3 5 luxPR promoter was achieved following the addition of 
the synthetic autoinducer N- (3-oxohexanoyl) -DL- 
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homoserine lactone to the culture media to a final 
concentration of 3 0 ng/ml and cultures were incubated 
at either 3 0°C or 37^C for a further 6 hours. After 5 
hours, the bacterial cultures were examined by 
5 microscopy for the presence of inclusion bodies and 
were then pelletted by centrifugation, Refractile 
inclusion bodies were observed in induced cultures 
indicating that some of the recombinant huOPG met [32- 
401] gene product was produced insolubly in E. coli . 

10 Some bacterial pellets were resuspended in lOmM Tris- 
HCl/pH8, ImM EDTA and lysed directly by addition of 2X 
Laemlli sample buffer to IX final, and P- 
mercaptoethanol to 5% final concentration, and analyzed 
by SDS-PAGE. A substantially more intense coomassie 

15 stained band of approximately 42kDa was observed on a 
SDS-PAGE gel containing total cell lysates of 30°C and 
37 induced cultures versus lane 2 which is a total 
cell lysate of a 30°C uninduced culture (Figure 14B) . 
The expected gene product would be 370 amino acids in 

2 0 length and have an expected molecular weight of about 

42.2 kDa, Following induction at 37°C for 6 hours, an 
additional culture was pelleted and either processed 
for isolation of inclusion bodies (see below) or 
processed by microf luidizing. The pellet processed for 
25 microf luidizing was resuspended in 25mM Tris-HCl/pH8 , 
0.5M NaCl buffer and passed 20 times through a 
Microf luidizer Model 1108 (Microf luidics Corp,) and 
collected. An aliquot was removed of the collected 
sample (microf luidized total lysate) , and the remainder 

3 0 was pelleted at 20,000 x g for 20 minutes. The 

supernatant following centrifugation was removed 
(microf luidized soluble fraction) and the pellet 
resuspended in a 25mM Tris-HCl/pHB, 0.5M NaCl, 6M urea 
solution (microf luidized insoluble fraction) . To an 
3 5 aliquot of either the total soluble, or insoluble 

fraction was added to an equal volume of 2X Laemalli 



sample buffer and |3-mercaptoethanol to 5% final 
concentration. The samples were then analyzed by 
SDS-PAGE. A significant amount of recombinant huOPG 
met [32-401] gene product appeared to be found in the 
insoluble fraction. To purify the recombinant protein 
inclusion bodies were purified as follows: Bacterial 
cells were separated from media by density gradient 
centrifugation in a Beckman J-6B centrifuge equipped 
with a JS-4.2 rotor at 4,900 x g for 15 minutes at 4^0. 
The bacterial pellet was resuspended in 5 ml of water 
and then diluted to a final volume of 10 ml with water. 
This suspension was transferred to a stainless steel 
cup cooled in ice and subjected to sonic disruption 
using a Branson Sonifier equipped with a standard tip 
(power setting=5, duty cycle=95%, 80 bursts). The 
sonicated cell suspension was centrifuged in a Beckman 
Optima TLX ultracentrif uge equipped with a TLA 100,3 
rotor at 195,000 x g for 5 to 10 minutes at 23^C. The 
supernatant was discarded and the pellet rinsed with a 
stream of water from a squirt bottle. The pellets were 
collected by scraping with a micro spatula and 
transferred to a glass homogenizer (15 ml capacity) . 
Five ml of Percoll solution (75% liquid Percoll, 0.15 M 
sodium chloride) was added to the homogenizer and the 
contents are homogenized until uniformly suspended. 
The volume was increased to 19.5 ml by the addition of 
Percoll solution, mixed, and distributed into 3 Beckman 
Quick-Seal tubes (13 x 32 mm). Tubes were sealed 
according to manufacturers instructions. The tubes 
were spun in a Beckman TLA 100.3 rotor at 23 ^C, 20,000 
rpm (21,600 x g), 30 minutes. The tubes were examined 
for the appropriate banding pattern. To recover the 
refractile bodies, gradient fractions were recovered 
and pooled, then diluted with water. The inclusion . 
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bodies were pelleted by centrifugation, and the protein 
concentration estimated following SDS-PAGE. 

An aliquot of inclusion bodies isolated as 
described below was dissolved into IX Laemlli sample 
5 buffer with 5% p-mercaptoethanol and resolved on a SDS- 
PAGE gel and the isolated inclusion bodies provide a 
highly purified recombinant huOPG[32-401] gene product. 
The major --42 kDa band observed after resolving 
inclusion bodies on a SDS-polyacrylamide gel was 
10 excised from a separate gel and the N-terminal amino 
acid sequence determined essentially as described 
(Matsudaira et al . J. Biol. Chem. 262 . 10-35 (1987)). 
The following sequence was determined after 19 cycles: 

15 NH2 -MDEETSHQLLCDKCPPGTY-COOH (SEQ ID NO: 62) 

This sequence was found to be identical to the first 19 
amino acids encoded by the pAMG21 Hu-OPG met [32-401] 
expression vector, produced by a methionine residue 
20 provided by the bacterial expression vector. 

C. Human OPG met [22-401] 

A DNA sequence coding for an N- terminal 
methionine and amino acids 22 through 401 of human OPG 

25 was placed under control of the luxPR promoter in a 
prokaryotic plasmid expression vector pAMG21 as 
follows. Isolated plasmid DNA of pAMG21-huOPG met [32- 
401] (see Section B) was cleaved with Kpnl and BamHI 
restriction endonucleases and the resulting fragments 

30 were resolved on an agarose gel. The B fragment (-1064 
bp fragment) was isolated from the gel using standard 
methodology. Synthetic oligonucleotides (oligos) 
#1267-06 and #1267-07 were phosphorylated individually 
and allowed to form an oligo linker duplex, which 

3 5 contained Ndel and Kpnl cohesive ends, using methods 
described in Section B. The synthetic linker duplex 
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utilized coli co'dons and provided for an N-terminal 
methionine. The phosphorylated oligo linker containing 
Ndel and Kpnl cohesive ends and the isolated -1064 bp 
fragment of pAMG21-huOP met [32-401] digested with Kpnl 
5 and BamHI restriction endonucleases were directionally 
inserted between the Ndel and BamHI sites of pAMG21 
using standard recombinant DMA methodology. The 
ligation mixture was transformed into E^. coli host 3 93 
by electroporation utilizing the manufacturer's 
10 protocol. Clones were selected, plasmid DNA was 

isolated, and DNA sequencing was performed to verify 
the DNA sequence of the huOPG-met [22-401] gene. 



Oligo #1267-06 

15 5' -TAT GGA AAC TTT TCC TCC AAA ATA TCT TCA TTA TGA TGA 
AGA AAC TTC TCA TCA GCT GCT GTG TGA TAA ATG TCC GCC GGG 
TAC-3 • (SEQ ID NO: 63) 

Oligo #1257-07 

2 0 5 ' -CCG GCG GAC ATT TAT CAC ACA GCA GCT GAT GAG AAG TTT 

CTT CAT CAT AAT GAA GAT ATT TTG GAG GAA AAG TTT CCA- 3 ' 
(SEQ ID NO: 64) 

Cultures of pAMG21-huOPG-met [22-401] in E^ 
25 coli host 393 were placed in 2XYT media containing 20 
\ig/ml kanamycin and were incubated at 3 0°C prior to 
induction. Induction of recombinant gene product 
expression from the luxPR promoter of vector pAMG21 was 
achieved following the addition of the synthetic 

3 0 auto inducer N- (3-oxohexanoyl) -DL-homoserine lactone to 

the culture media to a final concentration of 3 0 ng/ml 
and incubation at either 30°C or 37^C for a further 6 
hours. After 6 hours, bacterial cultures were pelleted 
by centrifugation (=30^0 1+6 or 37^0 1+6) . Bacterial 
3 5 cultures were also either pelleted just prior to 

induction (=3 0°C Prel) or alternatively no autoinducer 
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was added to a separate culture which was allowed to 
incubate at 3 0^C for a further 6 hours to give an 
uninduced (UI) culture {=30^C UI) . Bacterial pellets 
of either 30*^C Prel, 30<^C UI, 30^C 1+6, or 37^0 1+6 
5 cultures were resuspended, lysed, and analyzed by SDS- 
polyacrylamide gel electrophoresis (PAGE) as described 
in Section B. Polyacrylamide gels were either stained 
with coomassie blue and/or Western transferred to 
nitrocellulose and iminunoprobed with rabbit anti-mu 
10 OPG-Fc polyclonal antibody as described in Example 10. 
The level of gene product following induction compared 
to either an uninduced (3 0^C UI) or pre-induction (30^C 
Prel) sample. 

15 D. Murine OPG met [22-4011 

A DMA sequence coding for an N- terminal 
methionine and amino acids 22 through 401 of the murine 
(mu) OPG (OPG) polypeptide was placed under control of 
the luxPR promoter in a prokaryotic plasmid expression 

20 vector pAMG21 as follows. PGR was performed using 
oligonucleotides #1257-16 and #1257-15 as primers, 
plasmid pRcCMV-Mu OPG DNA as a template and 
thermocycling conditions as described in Section B. 
The PGR product was purified and cleaved with Kpnl and 

25 BamHI restriction endonucleases as described in Section 
B. Synthetic oligos #1260-61 and #1260-82 were 
phosphorylated individually and allowed to form an 
oligo linker duplex with Ndel and Kpnl cohesive ends 
using methods described in Section B. The synthetic 

3 0 linker duplex utilized coli codons and provided for 
an N- terminal methionine. The phosphorylated linker 
duplex formed between oligos #12 60-51 and #1260-82 
containing Ndel and Kpnl cohesive ends and the Kpnl and 
BamHI digested and purified PGR product generated using 

35 oligo primers #1257-16 and #1257-15 were directionally 
inserted between the Ndel and BamHI sites of pAMG21 



- 83 - 

using standard methodology. The ligation mixture was 
transformed into coli host 3 93 by electroporation 
utilizing the manufacturer's protocol. Clones were 
selected, plasmid DNA was isolated, and DNA sequencing 
5 was performed to verify the DNA sequence of the MuOPG 
met [22-401] gene. 

Expression of recombinant muOPG met [22-401] 
polypeptide from cultures of 393 cells harboring 
plasmid pAMG21-MuOPG met [22-401] following induction 
10 was determined using methods described in Section C. 



Oligo #1257-15 

5 ' -TAC GCA CTG GAT CCT TAT AAG CAG CTT ATT TTC ACG 
GAT TGA AC-3' (SEQ ID NO: 65) 

15 

Oligo #1257-16 

5 ' -GTG CTC CTG GTA CCT ACC TAA AAC AGC ACT GCA CAG 
TG-3 ' (SEQ ID NO: 66) 



20 Oligo #1260-61 

5 ' -TAT GGA AAC TCT GCC TCC AAA ATA CCT GCA TTA CGA 
TCC GGA AAC TGG TCA TCA GCT GCT GTG TGA TAA ATG TGC TCC 
GGG TAC-3 • (SEQ ID NO: 67) 



25 Oligo #1260-82 

5 ' -CCG GAG CAC ATT TAT CAC ACA GCA GCT GAT GAC CAG 
TTT CCG GAT CGT AAT GCA GGT ATT TTG GAG GCA GAG TTT 
CCA-3 ' (SEQ ID NO: 68) 



3 0 E. Murine OPG met [32-401] 

A DNA sequence coding for an N-terminal 
methionine and amino acids 32 through 401 of murine OPG 
was placed under control of the luxPR promoter in a 
prolcaryotic plasmid expression vector pAMG21 as 

35 follows. To accomplish this. Synthetic oligos #1267-08 
and #1267-09 were phosphoir/lated individually and 
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allowed to form an oligo linker duplex using methods 
described in Section B. The synthetic linker duplex 
utilized coli codons and provided for an N-terminal 
methionine. The phosphorylated linker duplex formed 
between oligos #1267-08 and #1267-09 containing Ndel 
and Kpnl cohesive ends, and the Kpnl and BamHI digested 
and purified PGR product described earlier (see Section 
D) , was directionally inserted between the Ndel and 
BamHI sites of pAMG21 using standard methodology. The 
ligation mixture was' transformed into coli host 393 
by electroporation utilizing the manufacturer's 
protocol. Clones were selected, plasmid DNA was 
isolated, and DNA sequencing was performed to verify 
the DNA sequence of the muOPG-met [32-401] gene. 

Expression of recombinant muOPG-met [32-401] 
polypeptide from cultures of 393 cells harboring the 
pAMG21 recombinant plasmid following induction was 
determined using methods described in Section C. 

Oligo #1267-08 

5 ' -TAT GGA CCC AGA AAC TGG TCA TCA GOT GCT GTG TGA 
TAA ATG TGC TCC GGG TAC-3 ' (SEQ ID NO: 69) 

Oligo #1267-09 

5 ' -CCG GAG CAC ATT TAT CAC ACA GCA GCT GAT GAC CAG 
TTT CTG GGT CCA-3 ' (SEQ ID NO: 70) 

Murine OPG met-lvs [22-4011 

A DNA sequence coding for an N-terminal 
methionine followed by a lysine residue and amino acids 
22 through 401 of murine OPG was placed under control 
of the lux PR promoter in prokaryotic expression vector 
PAMG21 as follows. Synthetic oligos #1282-95 and 
#12 82-96 were phosphorylated individually and allowed 
to form an oligo linker duplex using methods described 
in Section B, The synthetic linker duplex utilized E^ 
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coli codons and provided for an N-terminal methionine. 
The phosphorylated linker duplex formed between oligos 
#1282-95 and #1282-96 containing Ndel and Kpnl cohesive 
ends and the Kpnl and BamHI digested and purified PGR 
product described in Section D was directionally 
inserted between the Ndel and BamHI sites in pAMG21 
using standard methodology. The ligation mixture was 
transformed into coli host 393 by electroporation 
utilizing the manufacturer's protocol. Clones were 
selected, plasmid DNA was isolated, and DNA sequencing 

was performed to verify the DNA sequence of the MuOPG 

Met-Lys [22-401] gene. 

Expression of recombinant MuOPG Met-Lys [22- 
401] polypeptide from transformed 393 cells harboring 
the recombinant pAMG21 plasmid following induction was 
determined using methods described in Section C. 

Oligo #1282-95 

5 ' -TAT GAA AGA AAC TCT GCC TCC AAA ATA CCT GCA TTA 
CGA TCC GGA AAC TGG TCA TCA GCT GCT GTG TGA TAA ATG TGC 
TCC GGG TAC-3 ' (SEQ ID N0:71) 

Oligo #1282-96 

5 • -CCG GAG CAC ATT TAT CAC ACA GCA GCT GAT GAC CAG 
TTT CCG GAT CGT AAT GCA GGT ATT TTG GAG GCA GAG TTT CTT 
TCA-3 ' (SEQ ID NO: 72) 

G. Murine OPG met-lvs- (his) 7 [22-4011 

A DNA sequence coding for N-terminal residues 
Met-Lys-His-His-His-His-His-His-His (=MKH) followed by 
amino acids 22 through 401 of Murine OPG was placed 
under control of the lux PR promoter in prokaryotic 
expression vector pAMG21 as follows. PCR was performed 
using oligonucleotides #1300-50 and #1257-15 as primers 
and plasmid pAMG21-muOPG-met [22-401] DNA as template, 
Thermocycling conditions were as described in Section 
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B. The resulting PGR sample was resolved on an agarose 
gel, the PGR product was excised, purified, cleaved 
with Ndel and BamHI restriction endonucleases and 
purified. The Ndel and BamHI digested and purified PGR 
product generated using oligo primers #13 00-50 and 
#1257-15 was directionally inserted between the Ndel 
and BamHI sites of pAMG21 using standard DNA 
methodology. The ligation mixture was transformed into 

coli host 393 by electroporation utilizing the 
manufacturer's protocol. Clones were selected, plasmid 
DNA was isolated, and DNA sequencing performed to 
verify the DNA sequence of the muOPG-MKH [22-401] gene. 

Expression of recombinant MuOPG-MKH [22-401] 
polypeptide from transformed 393 cultures harboring the 
recombinant pAMG21 plasmid following induction was 
determined using methods described in Section G. 

Oligo #1300-50 

5 ' -GTT GTG GTG ATA TGA AAG ATG ATG AGG ATG AGG ATG 

ATG AAA GTG TGG GTG GAA AAT AGG TGG ATT AGG AT-3 ' {SEQ 
ID NO: 73} 

Oligo #1257-15 
(see Section D) 

H. Murine OPG met-lvs [22-4011 (his W 

A DNA sequence coding for a N- terminal met- 
lys, amino acids 22 through 401 murine OPG, and seven 
histidine residues following amino acid 401 (=muOPG 
MK [22-401] -H7) , was placed under control of the lux PR 
promoter in prokaryotic expression vector pAMG21 as 
follows. PGR was performed using oligonucleotides 
#13 00-49 and #1300-51 as primers and pAMG21-muOPG 
met [22-401] DNA as template. Thermocycling conditions 
were as described in Section B. The resulting PGR 
sample was resolved on an agarose gel, the PGR product 
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was excised, purified, restricted with Ndel and BamHI 
restriction endonucleases, and purified. The Ndel and 
BamHI digested and purified PGR product was 
directionally inserted between the Ndel and BamHI sites 
5 in pAMG21 using standard methodology. The ligation was 
transformed into E, coli host 393 by electroporation 
utilizing the manufacturer's protocol. Clones were 
selected, plasmid DNA was isolated, and DNA sequencing 
was performed to verify the DNA sequence of the muOPG 
10 MK[22-401] -H7 gene. 

Expression of the recombinant muOPG 
MK- [22-401] -H7 polypeptide from a transformed 393 cells 
harboring the recombinant pAMG21 plasmid following 
induction was determined using methods described in 
15 Section C. 

Oligo #1300-49 

5 ' -GTT CTC CTC ATA TGA AAG AAA CTC TGC CTC CAA AAT 
ACC TGC A-3 ' (SEQ ID NO:74) 

20 

Oligo #1300-51 

5 • -TAG GCA CTG GAT CCT TAA TGA TGG TGA TGG TGA TGA 
TGT AAG CAG CTT ATT TTC ACG GAT TGA ACC TGA TTC CCT A- 
3 ' (SEQ ID NO:75) 

25 

I. Murine QPG met [27-4011 

A DNA sequence coding for a N-terminal 
methionine and amino acids 27 through 401 of murine OPG 
was placed under control of the lux PR promoter of 

30 prokaryotic expression vector pAMG21 as follows, PCR 
was performed with oligonucleotides #13 09-74 and 
#1257-15 as primers and plasmid pAMG21-muOPG-met [22- 
401] DNA as template. Thermocycling conditions were as 
described in Section B. The resulting PCR sample was 

35 resolved on an agarose gel, the PCR product was 
excised, purified, cleaved with Ndel and BamHI 
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restriction endonucleases , and purified. The Ndel and 
BamHI digested and purified PGR product was 
directionally inserted between the Ndel and BamHI sites 
of PAMG21 using standard methodology. The ligation 
mixture was transformed into E, coli host 3 93 by 
electroporation utilizing the manufacturer's protocol. 
Clones were selected, plasmid DNA was isolated, and DNA 
sequencing was performed to verify the DNA sequence of 
the muOPG-met [27-401] gene. 

Expression of recombinant muOPG-met [27-401 ] 
polypeptide from a transfected 3 93 culture harboring 
the recombinant pAMG21 plasmid following induction was 
determined using methods described in Section C, 

Oligo#1309-74 

5*-GTT CTC CTC ATA TGA AAT ACC TGC ATT ACG ATC CGG 
AAA CTG GTC AT-3 ' (SEQ ID NO:76) 

Oligo#1257-15 
(See Section D) 

J. Human QPG met [27-4011 

A DNA sequence coding for a N-terminal 
methionine and amino acids 27 through 401 of human OPG 
was placed under control of the lux PR promoter of 
prokaryotic expression vector pAMG21 as follows. PGR 
was performed using oligonucleotides #13 09-75 and 
#1309-76 as primers and plasmid pAMG21-huOPG-met [22- 
401] DNA as template. Thermocycling conditions were as 
described in Section B. The resulting PGR sample was 
resolved on an agarose gel, the PGR product was 
excised, purified, restricted with Asel and BamHI 
restriction endonuc leases, and purified. The Asel and 
BamHI digested and purified PGR product above was 
directionally inserted between the Ndel and BamHI sites 
of pAMG21 using standard methodology. The ligation 
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mixture was transformed into coli host 393 by 
electroporation utilizing the manufacturer's protocol. 
Clones were selected, plasmid DNA was isolated, and DNA 
sequencing was performed to verify the DNA sequence of 
the huOPG-met [27-401] gene. 

Expression of the recombinant huOPG-met [27- 
401] polypeptide following induction of from 
transfected 3 93 cells harboring the recombinant pAMG21 
plasmid was determined using methods described in 
Section C. 

Oligo #1309-75 

5 ' -GTT CTC CTA TTA ATG AAA TAT CTT CAT TAT GAT GAA 
GAA ACT T-3 ' (SEQ ID NO: 77) 

Oligo #1309-76 

5 ' -TAC GCA CTG GAT CCT TAT AAG CAG CTT ATT TTT ACT 
GAT T-3 ' (SEQ ID NO:78) 

K. Murine OPG met [22-1801 

A DNA sequence coding for a N-terminal 
methionine and amino acids 22 through 180 of murine OPG 
was placed under control of the lux PR promoter of 
prolcaryotic expression vector pAMG21 as follows. PGR 
was performed with oligonucleotides #13 09-72 and #1309- 
73 as primers and plasmid pAMG21-muOPG-met [22-401] DNA 
as template. Thearmocycling conditions were as 
described in Section B. The resulting PGR sample was 
resolved on an agarose gel, the PGR product was 
excised, purified, restricted with Ndel and BamHI 
restriction endonuc leases, and purified. The Ndel and 
BamHI digested and purified PGR product above was 
directionally inserted between the Ndel and BamHI sites 
of PAMG21 using standard methodology. The ligation was 
transfoirmed into E^ coli host 393 by electroporation 
utilizing the manufacturer's protocol. Clones were 



- 90 - 



selected, plasmid DNA was isolated, and DNA sequencing 
was performed to verify the DNA sequence of the muOPG- 
met [22-180] gene. 

Expression of recombinant muOPG-met [22-180] 
polypeptide from transformed 3 93 cultures harboring the 
recombinant pAMG21 plasmid following induction was 
determined using methods described in Section C. 

Oligo #1309-72 

5 ' -GTT CTC CTC ATA TGG AAA CTC TGC CTC CAA AAT ACC 
TGC A-3 ' (SEQ ID NO: 79) 

Oligo #1309-73 

5 ' -TAG GCA CTG GAT OCT TAT GTT GCA TTT CCT TTC TGA 
ATT AGO A-3' (SEQ ID NO:80) 

L. Murine OPG met r 27-1801 

A DNA sequence coding for a N-teiiminal 
methionine and amino acids 27 through 180 of murine OPG 
was placed under the control of the lux PR promoter of 
proJcaryotic expression vector pAMG21 as follows. PGR 
was performed using oligonucleotides #1309-74 (see 
Section I) and #13 09-73 (see Section K) as primers and 
plasmid pAMG21-muOPG met [22-401] DNA as template. 
Thermocycling conditions were as described in Section 
B. The resulting PGR sample was resolved on an agarose 
gel, the PGR product excised, purified, restricted with 
Ndel and BamHI restriction endonucleases , and purified. 
The Ndel and BamHI digested and purified PGR product 
above was directionally inserted between the Ndel and 
BamHI sites in pAMG21 using standard methodology. The 
ligation mixture was transformed into E^ coli host 3 93 
by electroporation utilizing the manufacturer's 
protocol. Clones were selected, plasmid DNA was 
isolated, and DNA sequencing was performed to verify 
the DNA sequence of the muOPG met [27-180] gene. 



Expression of recombinant muOPG met [27-180] 
polypeptide from cultures of transformed 393 cells 
harboring the recombinant pAMG21 plasmid following 
induction was determined using methods described in 
Section C. 

M. Muri ne OPG met [22-1891 and metr22-194] 

A DNA sequence coding for a N-terminal 
methionine and either amino acids 22 through 189, or 22 
through 194 of murine OPG was placed under control of 
the lux PR promoter of prokaryotic expression vector 
PAMG21 as follows. The pair of synthetic 
oligonucleotides #1337-92 and #1337-93 (=muOPG-189 
linker) or #1333-57 and #1333-58 (=muOPG-194 linker) 
were phosphorylated individually and allowed to form an 
oligo linker duplex pair using methods described in 
Section B. Purified plasmid DNA of pAMG21-muOPG- 
met [22-401] was cleaved with Kpnl and BspEI restriction 
endonucleases and the resulting DNA fragments were 
resolved on an agarose gel. The ^413 bp B fragment was 
isolated using standard recombinant DNA methodology. 
The phosphorylated oligo linker duplexes formed between 
either oligos #1337-92 and #1337-93 (muOPG-189 linker) 
or oligos #1333-57 and #1333-58 (muOPG-194 linker) 
containing BspEI and BamHI cohesive ends, and the 
isolated -413 bp B fragment of plasmid pAMG21-muOPG- 
met [22-401] digested with Kpnl and BspEI restriction 
endonucleases above, was directional ly inserted between 
the Kpnl and BamHI sites of pAMG21-muOPG met [22-401] 
using standard methodology. Each ligation mixture was 
transformed into E^ coli host 393 by electroporation 
utilizing the manufacturer's protocol. Clones were 
selected, plasmid DNA was isolated, and DNA sequencing 
was performed to verify the DNA sequence of either the 
muOPG-met [22-189] or muOPG-met [22-194] genes. 
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Expression of recombinant muOPG-met [22 -189 ] 
and muOPG-met [22-194] polypeptides from recombinant 
pAMG21 plasmids transformed into 3 93 cells was 
determined using methods described in Section C. 

5 

Oligo #1337-92 

5'-CCG GAA ACA GAT AAT GAG-3 ' (SEQ ID NO: 81) 

Oligo #1337-93 
10 5^ -GAT OCT CAT TAT CTG TTT-3 ' (SEQ ID NO: 82) 

Oligo #1333-57 

5 • -CCG GAA ACA GAG AAG CCA CGC AAA AGT AAG-3 ' 
(SEQ ID NO: 83) 

15 

Oligo #1333-58 

5 » -GAT CCT TAC TTT TGC GTG GCT TCT CTG TTT-3 ' 
(SEQ ID NO: 84) 

20 N, Murine OPG met [27-1891 and met [27-1941 

A DNA sequence coding for a N-terminal 
methionine and either amino acids 27 through 189, or 27 
through 194 of murine OPG was placed under control of 
the lux PR promoter of prokaryotic expression vector 

25 PAMG21 as follows. Phosphorylated oligo linkers either 
"muOPG-189 linker" or "muOPG-194 linker" (see Section 
M) containing BspEI and BamHI cohesive ends, and the 
isolated -413 bp B fragment of plasmid pAMG21-muOPG- 
met [22-401] digested with Kpnl and BspEI restriction 

3 0 endonucleases were directionally inserted between the 
Kpnl and BamHI sites of plasmid pAMG21-muOPG-met [27- 
401] using standard methodology. Each ligation was 
transformed into E. coli host 393 by electroporation 
utilizing the manufacturer's protocol. Clones were 

3 5 selected, plasmid DNA was isolated, and DNA sequencing 



A 
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was performed to verify the DNA sequence of either the 
muOPG met [27-189] or muOPG met [27-194] genes. 

Expression of recombinant muOPG met [27-189] 
and muOPG met [27-194] following induction of 393 cells 
5 harboring recombinant pAMG21 plasmids was determined 
using methods described in Section C, 

O. Human OPG met [22-1851 , metr22-1891, metr22-1941 
A DNA sequence coding for a N-terminal 

10 methionine and either amino acids 22 through 185, 22 
through 189, or 22 through 194 of the human OPG 
polypeptide was placed under control of the lux PR 
promoter of pro]<aryotic expression vector pAMG21 as 
follows. The pair of synthetic oligonucleotides #1331- 

15 87 and #1331-88 (=huOPG-185 linlcer) , #1331-89 and 

#1331-90 (=huOPG-189 linlcer) , or #1331-91 & #1331-92 
(=huOPG-194 linJcer) were phosphorylated individually 
and each allowed to form an oligo linJcer duplex pair 
using methods described in Section B. Purified plasmid 

20 DNA of pAMG21-huOPG-met [27-401] was restricted with 
Kpnl and Ndel restriction endonucleases and the 
resulting DNA fragments were resolved on an agarose 
gel. The -407 bp B fragment was isolated using 
standard recombinant DNA methodology. The 

25 phophorylated oligo linlcer duplexes formed between 

either oligos #1331-87 and #1331-88 (huOPG-185 linker) , 
oligos #1331-89 and #1331-90 (huOPG-189 linker) , or 
oligos #1331-91 and #1331-92 (huOPG-194 linker) [each 
linker contains Ndel and BamHI cohesive ends] , and the 

30 isolated --407 bp B fragment of plasmid pAMG21-huOPG- 
met [27-401] digested with Kpnl and Ndel restriction 
endonucleases above, was directionally inserted between 
the Kpnl and BamHI sites of plasmid pAMG21-huOPG- 
met [22-401] using standard methodology. Each ligation 

3 5 was transformed into coli host 393 by 

electroporation utilizing the manufacturer's protocol. 
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Clones were selected, plasmid DNA was isolated, and DNA 
sequencing was performed to verify the DNA sequence of 
either the huOPG-met [22-185] , huOPG-met [22-189] , or 
huOPG-met [22-194 ] genes . 

Expression of recombinant huOPG-met [22-185] , 
huOPG-met [22-189] or huOPG-met [22-194] in transformed 
3 93 cells harboring recombinant pAMG21 plasmids 
following induction was determined using methods 
described in Section C. 

Oligo #1331-87 

5' -TAT GTT AAT GAG-3 ' (SEQ ID NO: 85) 



Oligo #1331-88 
15 5' -GAT CCT CAT TAA CA-3 ' (SEQ ID NO: 86) 

Oligo #1331-89 

5* -TAT GTT CCG GAA ACA GTT AAG-3 ' (SEQ ID NO: 87) 

20 Oligo #1331-90 

5' -GAT CCT TAA CTG TTT CCG GAA (:A-3 ' (SEQ ID 
NO: 88) 



Oligo #1331-91 
25 5* -TAT GTT CCG GAA ACA GTG AAT CAA CTC AAA AAT 

AAG-3' (SEQ ID NO: 89) 

Oligo #1331-92 

5 ' -GAT CCT TAT TTT TGA GTT GAT TCA CTG TTT CCG GAA 
3 0 CA-3' (SEQ ID NO: 90) 



P. Human OPG met [27-1851 . metr27-1891, met [27-1941 
A DNA sequence coding for a N-terminal 
35 methionine and either amino acids 27 through 185, 27 
through 189, or 27 through 194 of the human OPG 



polypeptide was placed under control of the lux PR 
promoter of prokaryotic expression vector pAMG21 as 
follows. Phosphorylated oligo linkers "huOPG-185 
linker", "huOPG-189 linker", or "huOPG-194 linker" (See 
Section O) each containing Ndel and BamHI cohesive 
ends, and the isolated -407 bp B fragment of plasmid 
pAMG21-huOPG-met [27-401] digested with Kpnl and Ndel 
restriction endonucleases (See Section O) were 
directionally inserted between the Kpnl and BamHI sites 
of plasmid pAMG21-huOPG-met [27-401] (See Section J) 
using standard methodology. Each ligation was 
transformed into coli host 393 by electroporation 
utilizing the manufacturer's protocol. Clones were 
selected, plasmid DNA isolated, and DNA sequencing 
performed to verify the DNA sequence of either the 
huOPG-met [27-185] , huOPG-met [27-189] , or huOPG-met[27- 
194] genes. 

Expression of recombinant huOPG-met [27-185 ] , 
huOPG-met [27-189] , and huOPG-met [27-194] from 
recombinant pAMG21 plasmids transformed into 393 cells 
was determined using methods described in Section C. 

0. Muri ne OPG met [27-4011 (P33E. G36S, A45P) 

A DNA sequence coding for an N- terminal 
methionine and amino acids 27 through 48 of human OPG 
followed by amino acid residues 49 through 401 of 
murine OPG was placed under control of the lux PR 
promoter of prokaryotic expression vector pAMG21 as 
follows. Purified plasmid DNA of pAMG21-huOPG-met [27- 
401] (See Section J) was cleaved with Aatll and Kpnl 
restriction endonucleases and a -1075 bp B fragment 
isolated from an agarose gel using standard recombinant 
DNA methodology. Additionally, plasmid pAMG21-muOPG- 
met [22-4011 DNA (See Section D) was digested with Kpnl 
and BamHI restriction endonucleases and the -1064 bp B 
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fragment isolated as described above. The isolated 
--1075 bp pAMG21-huOPG-met [27-401] restriction fragment 
containing Aatll & Kpnl cohesive ends (see above) , the 
--1064 bp pAMG21-muOPG-met [22-401] restriction fragment 
5 containing Kpnl and BamHI sticlcy ends and a -5043 bp 
restriction fragment containing Aatll and BamHI 
cohesive ends and corresponding to the nucleic acid 
sequence of pAMG21 between Aatll Sc BamHI were ligated 
using standard recombinant DNA methodology. The 

10 ligation was transformed into E_:_ coli host 393 by 

electroporation utilizing the manufacturer's protocol. 
Clones were selected, and the presence of the 
recombinant insert in the plasmid verified using 
standard DNA methodology. muOPG-27-401 (P33E, G36S, 

15 A45P) gene. Amino acid changes in muOPG from proline- 
33 to glutamic acid-33, glycine-36 to serine-3 6, and 
alanine-45 to proline-45, result from replacement of 
muOPG residues 27 through 48 with huOPG residues 27 
through 48. 

20 Expression of recombinant muOPG-met [27-401] 

(P3 3E, G3 6S, A45P) from transformed 393 cells harboring 
the recombinant pAMG21 plasmid was determined using 
methods described in Section C, 

25 

R. Murine QPG met-lvs- (his) 7 -ala-ser- (asp)^-lvs [22-4011 
(A45T) 

A DNA sequence coding for an N-terminal His 
tag and enterolcinase recognition sequence which is (NH2 

30 to COOH terminus) : Met-Lys-His-His-His-His-His-His- 
His-Ala-Ser-Asp-Asp-Asp-Asp-Lys (=HEK) , followed by 
amino acids 22 through 401 of the murine OPG 
polypeptide was placed under control of the lac 
repressor regulated Ps4 promoter as follows, pAMG22- 

35 His (See Section A) was digested with Nhel and BamHI 

restriction endonuc leases , and the large fragment (the 
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A fragment) isolated from an agarose gel using standard 
recombinant DNA methodology. Oligonucleotides #1282-91 
and #1282-92 were phosphorylated individually and 
allowed to form an oligo linker duplex using methods 
5 previously described (See Section B) . The 

phosphorylated linker duplex formed between oligos 
#1282-91 and #1282-92 containing Nhel and Kpnl cohesive 
ends, the Kpnl and BamHI digested and purified PGR 
product- described (see Section D) , and the A fragment 

10 of vector pAMG22-His digested with Nhel and BamHI were 
ligated using standard recombinant DNA methodology. 
The ligation was transformed into E^. coli host GM120 by 
electroporation utilizing the manufacturer's protocol. 
Clones were selected, plasmid DNA isolated and DNA 

15 sequencing performed to verify the DNA sequence of the 
muOPG-HEK[22-401] gene, DNA sequencing revealed a 
spurious mutation in the natural muOPG sequence that 
resulted in a single amino- acid change of Alanine-45 of 
muOPG polypeptide to a Threonine. 

2 0 Expression of recombinant muOPG-HEK [22-401] 

(A45T) from GM120 cells harboring the recombinant 
pAMG21 plasmid was deteirmined using methods similar to 
those described in Section C, except instead of 
addition of the synthetic autoinducer, IPTG was added 

25 to 0.4 mM final to achieve induction. 



Oligo #1282-91 

5'-CTA GCG ACG ACG ACG 
AAT ACC TGC ATT ACG ATC CGG 
3 0 GTG ATA AAT GTG CTC CGG GTA 



ACA AAG AAA CTC TGC CTC CAA 
AAA CTG GTC ATC AGC TGC TGT 
C-3 * (SEQ ID N0:91) 



Oligo #1282-92 

5 ' -CCG GAG CAC ATT TAT CAC ACA GCA GCT GAT GAC CAG 
TTT CCG GAT CGT AAT GCA GGT ATT TTG GAG GCA GAG TTT CTT 
35 TGT CGT CGT CGT CG-3 ' (SEQ ID NO:92) 



- 98 - 



S> Human QP G itiet-arg-glv-ser- (his)£r22-4011 

Eight oligonucleotides (1338-09 to 1338-16 
shown below) were designed to produce a 175 base 
fragment as overlapping, double stranded DNA. The 
5 oligos were annealed, ligated, and the 5' and 3' oligos 
were used as PGR primers to produce large quantities of 
the 175 base fragment. The final PGR gene products 
were digested with restriction endonucleases Glal and 
Kpnl to yield a fragment which replaces the N-terminal 

10 28 codons of human OPG. The Clal and Kpnl digested PGR 
product was inserted into pAMG21-huOPG [27-401] which 
had also been cleaved with Glal and Kpnl.- Ligated DNA 
was transformed into competent host cells of E. coli 
strain 393, Glones were screened for the ability to 

15 produce the recombinant protein product and to possess 
the gene fusion having the correct nucleotide sequence. 
Protein expression levels were determined from 50 ml 
shaker flask studies. Whole cell lysate and sonic 
pellet were analyzed for expression of the construct by 

20 Goomassie stained PAGE gels and Western analysis with 

murine anti-OPG antibody. Expression of huOPG Met-Arg- 
Gly-Ser- (His) 6 [22-401] resulting in the formation of 
large inclusion bodies and the protein was localized to 
the insoluble (pellet) fraction. 

25 

1338-09 

ACA AAC ACA ATC GAT TTG ATA OTA OA ( SEQ ID NO: 93) 
1338-10 

3 0 TTT GTT TTA ACT AAT TAA AGG AGG AAT AAA ATA TGA GAG GAT CGC ATC AC 
(SEQ ID NO: 94) 

1338-11 

CAT CAC CAT CAC GAA ACC TTC CCG CCG AAA TAG CTG CAC TAG GAC GAA. GA 
35 (SEQ ID NO:95) 
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1338-12 

AAC CTC CCA CCA GCT GCT GTG CGA CAA ATG CCC GCC GGG TAG CCA AAC A 
(SEQ ID NO: 96) 

5 

1338-13 

TGT TTG GGT ACC CGG CGG GCA TTT GT (SEQ ID NO: 97) 
1338-14 

10 CGC ACA GCA GCT GGT GGG AGG TTT CTT CGT CGT AGT GCA GGT ATT TCG GC 
(SEQ ID NO: 98) 

1338-15 

GGG AAG GTT TCG TGA TGG TGA TGG TGA TGC GAT CCT CTC ATA TTT TAT T 
15 (SEQ ID NO: 99) 

1338-16 

CCT CCT TTA ATT AGT TAA AAC AAA TCT AGT ATC AAA TCG ATT GTG TTT GT 
(SEQ ID NO: 100) 

20 

T. Human OPG itiet-lvs f 22-4011 and met'(lvs)2l"22-40n 

To construct the met-lys and met-(lys)3 
versions of human OPG[22-401] , overlapping 
oligonucleotides were designed to add the appropriate 

25 number of lysine residues. The two oligos for each 

construct were designed to overlap, allowing two rounds 
of PGR to produce the final product. The template for 
the first PGR reaction was a plasmid DNA preparation 
containing the human OPG 22-401 gene. The first PGR 

3 0 added the lysine residue (s) . The second PGR used the 
product of the first round and added sequence back to 
the first restriction site, Glal . 

The final PGR gene products were digested 
with restriction endonucleases Clal and Kpnl, which 

3 5 replace the N- terminal 28 codpns of hu OPG, and then 
ligated into plasmid pAMG21-hu OPG [27-401] which had 
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been also digested with the two restriction 
endonucleases . Ligated DNA was transformed into 
competent host cells of coli strain 393. Clones 
were screened for the ability to produce the 
5 recombinant protein product and to possess the gene 

fusion having the correct nucleotide sequence. Protein 
expression levels were determined from 50 ml shaker 
flask studies. Whole cell lysate and sonic pellet were 
analyzed for expression of the construct by Coomassie 
10 stained PAGE gels and Western analysis with murine 

anti-OPG antibody. Neither construct had a detectable 
level of protein expression and inclusion bodies were 
not visible. The DNA sequences were confirmed by DNA 
sequencing. 

15 

Oligonucleotide primers to prepare Met-Lys huOPG[22- 

401] : 

1338-17 

ACA AAC ACA ATC GAT TTG ATA CTA GAT TTG TTT TAA CTA ATT 
20 AAA GGA GGA ATA AAA TG (SEQ ID NO: 101) 

1338-18 

CTA ATT AAA GGA GGA ATA AAA TGA AAG AAA CTT TTC CTC CAA 
AAT ATC (SEQ ID NO: 102) 

25 

1338-20 

TGT TTG GGT ACC CGG CGG ACA TTT ATC ACA C (SEQ ID 
NO:103) 

30 Oligonucleotide primers to prepare Met- (Lys) 3-huOPG [22- 
401] : 
1338-17 

ACA AAC ACA ATC GAT TTG ATA CTA GAT TTG TTT TAA CTA ATT 
AAA GGA GGA ATA AAA TG (SEQ ID NO: 104) 



35 
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1338-19 

CTA ATT AAA GGA GGA ATA AAA TGA AAA AAA AAG AAA CTT TTC 
CTC CAA AAT ATC (SEQ ID NO: 105) 

5 1338-20 

TGT TTG GGT ACC CGG CGG ACA TTT ATC ACA C (SEQ ID 
NO:106) 

U. Human and Murine OPG [22-4011 /Fc Fusions 
10 Four OPG-Fc fusions were constructed where 

the Fc region of human IgGl was fused at the N-tenninus 
of either human or murine Osteoprotegerin amino acids 
22 to 401 (referred to as Fc/OPG [22-401]) or at the C- 
terminus (referred to as OPG [22-401] /Fc) , Fc fusions 
15 were constructed using the fusion vector pFc-A3 
described in Example 7 . 

All fusion genes were constructed using 
standard PGR technology. Template for PGR reactions 
were plasmid preparations containing the target genes. 

2 0 Overlapping oligos were designed to combine the 

C-terminal portion of one gene with the N terminal 
portion of the other gene. This process allows fusing 
the two genes together in the correct reading frame 
after the appropriate PGR reactions have been 
25 performed. Initially one "fusion" oligo for each gene 
was put into a PGR reaction with a universal primer for 
the vector carrying the target gene. The complimentary 
"fusion" oligo was used with a universal primer to PGR 
the other gene. At the^ end of this first PGR reaction, 

3 0 two separate products were obtained, with each 

individual gene having the fusion site present, 
creating enough overlap to drive the second round of 
PGR and create the desired fusion. In the second round 
of PGR, the first two PGR products were combined along 
35 with universal primers and via the overlapping regions, 
the full length fusion DNA sequence was produced. 
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The final PGR gene products were digested 
with restriction endonucleases Xbal and BamHI, and then 
ligated into the vector pAMG21 having been also 
digested with the two restriction endonucleases. 
5 Ligated DNA was transformed into competent host cells 
of E. coli strain 393. Clones were screened for the 
ability to produce the recombinant protein product and 
to possess the gene fusion having the correct 
nucleotide sequence. Protein expression levels were 
10 determined from 50 ml shaker flask studies. Whole cell 
lysate, sonic pellet, and supernatant were analyzed for 
expression of the fusion by Coomassie stained PAGE gels 
and Western analysis with murine anti-OPG antibody, 

15 Fc/huOPG r22-40n 

Expression of the Fc/hu OPG [22-401] fusion 
peptide was detected on a Coomassie stained PAGE gel 
and on a Western blot. The cells have very large 
inclusion bodies, and the majority of the product is in 

2 0 the insoluble (pellet) fraction. The following primers 

were used to construct this OPG-Fc fusion: 

1318-48 

25 CAG CCC GGG-TAA AAT GGA AAC GTT TCC TCC AAA ATA TCT TCA 
TT (SEQ ID NO: 107) 

1318-49 

CGT TTC CAT TTT ACC CGG GCT GAG CGA GAG GCT CTT CTG CGT 

3 0 GT (SEQ ID NO: 108) 

Fc/muOPG [22-4011 

Expression of the fusion peptide was detected 
on a Coomassie stained gel and on a Western blot. The 
3 5 cells have very large inclusion bodies, and the 

majority of the product is in the insoluble (pellet) 
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fraction. The following primers were used to construct 
this OPG-Fc fusion: 

1318-50 

5 CGC TCA GCC CGG GTA AAA TGG AAA CGT TGC CTC CAA AAT ACC 
TGC (SEQ ID NO: 109) 

1318-51 

CCA TTT TAG CCG GGC TGA GCG AGA GGC TCT TCT GCG TGT 
10 (SEQ ID NO: 110) 

muOPG r22-4011/Fc 

Expression of the fusion peptide was detected 
on a Coomassie stained gel and on a Western blot. The 

15 amount of recombinant product was less than the OPG 

fusion proteins having the Fc region in the N terminal 
position. Obvious inclusion bodies were not detected. 
Most of the product appeared to be in the insoluble 
(pellet) fraction. The following primers were used to 

20 construct this OPG-Fc fusion: 

1318-54 

GAA AAT AAG CTG CTT AGC TGC AGC TGA ACC AAA ATC 
(SEQ ID N0:111) 

25 

1318-55 

CAG CTG CAG CTA AGC AGC TTA TTT TCA CGG ATT G 
(SEQ ID NO: 112) 

30 

huOPG [22-4011 /Fc 

Expression of the fusion peptide was not 
detected on a Coomassie stained gel, although a faint 
Western positive signal was present. Obvious inclusion 
3 5 bodies were not detected. The following primers were 
used to prepare this OPG-Fc fusion: 
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1318-52 

AAA AAT AAG CTG CTT AGC TGC AGC TGA ACC AAA ATC 
(SEQ ID NO: 113) 

5 

1318-53 

GAG CTG GAG CTA AGC AGC TTA TTT TTA CTG ATT GG 
(SEQ ID NO: 114) 

10 V. Human OPG met [22-4011 -Fc fusion (P25A) 

This construct combines a proline to alanine 
amino acid change at position 25 (P25A) with the huOPG 
met [22-401] -Fc fusion. The plasmid was digested with 
restriction endonucleases Clal and Kpnl, which removes 

15 the N- terminal 28 codons of the gene, and the resulting 
small (less than 2 00 base pair) fragment was gel 
purified. This fragment containing the proline to 
alanine change was then ligated into plasmid pAMG21- 
huOPG [22-401] -Fc fusion which had been digested with 

20 the two restriction endonucleases. The ligated DNA was 
transformed into competent host cells of E^ coli strain 
3 93. Clones were screened for the ability to produce 
the recombinant protein product and to possess the gene 
fusion having the correct nucleotide sequence. Protein 

25 expression levels were determined from 50 ml shaker 

flaslc studies. Whole cell lysate and sonic pellet were 
analyzed for expression of the construct by Coomassie 
stained PAGE gels and Western analysis with murine 
anti-OPG antibody. The expression level of the fusion 

3 0 peptide was detected on a Coomassie stained PAGE gel 
and on a Western blot. The protein was in the 
insoluble (pellet) fraction. The cells had large 
inclusion bodies. 
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W. Human QPG met [22-4011 (P25A) 

A DNA sequence coding for an N- terminal 
methionine and amino acids 22 through 401 of human OPG 
with the proline at position 25 being substituted by 
5 alanine under control of the lux Pr promoter in 

prokairyotic expression vector pAMG21 was constructed as 
follows: Synthetic oligos # 1289-84 and 1289-85 were 
annealed to form an oligo linker duplex with Xbal and 
Kpnl cohesive ends. The synthetic linker duplex 

10 utilized optimal E. coli codons and encoded an N- 

terminal methionine. The linker also included an Spel 
restriction site which was not present in the original 
sequence. The linker duplex was directionally inserted 
between the Xbal and Kpnl sites in pAMG21-huOPG-22-401 

15 using standard methods. The ligation mixture was 

introduced into E. coli host GM221 by transformation. 
Clones were initially screened for production of the 
recombinant protein. Plasmid DNA was isolated from 
positive clones and DNA sequencing was performed to 

20 verify the DNA sequence of the HuOPG-Met [22-401] (P25A) 
gene. The following oligonucleotides were used to 
generate the Xbal - Kpnl linker: 

Oligo #1289-84 

25 5*-CTA GAA GGA GGA ATA AC A TAT GGA AAC TTT TGC TCC 

AAA ATA TCT TCA TTA TGA TGA AGA AAC TAG TCA TCA GCT GCT 
GTG TGA TAA ATG TCC GCC GGG TAC -3^ (SEQ ID NO: 115) 

30 Oligo #1289-85 

5 ' - CCG GCG GAC ATT TAT CAC ACA GCA GCT GAT GAC 
TAG TTT CTT CAT CAT AAT GAA GAT ATT TTG GAG CAA AAG TTT 
CCA TAT GTT ATT CCT CCT T-3 ' (SEQ ID NO: 116) 



- 106 - 



2L Human QP G met [22-4011 (P26A) and (P26D) 

A DNA sequence coding for an N-terminal 
methionine and amino acids 22 through 401 of human OPG 
with the proline at position 26 being substituted by 
5 alanine under control of the lux Pr promoter in 

prokaryotic expression vector pAMG21 was constructed as 
follows: Synthetic oligos # 1289-86 and 1289-87 were 
annealed to form an oligo linker duplex with Xbal and 
Spel cohesive ends. The synthetic linker duplex 

10 utilized optimal E. coli codons and encoded an N- 
terminal methionine. The linker duplex was 
directionally inserted between the Xbal and Spel sites 
in pAMG21-huOPG [22-401] (P25A) using standard methods. 
The ligation mixture was introduced into E. coli host 

15 GM221 by transformation. Clones were initially 

screened for production of the recombinant protein. 
Plasmid DNA was isolated from positive clones and DNA 
sequencing was performed to verify the DNA sequence of 
the huOPG-met [22-401] (P26A) gene. One of the clones 

2 0 sequenced was found to have the proline at position 26 
substituted by aspartic acid rather than alanine, and 
this clone was designated huOPG-met [22-401] (P26D) . The 
following oligonucleotides were used to generate the 
Xbal - Spel linker: 

25 

Oligo #1289-86 

5 ' - CTA GAA GGA GGA ATA ACA TAT GGA AAC TTT TCC 
TGC TAA ATA TCT TCA TTA TGA TGA AGA AA - 3 ' 
(SEQ ID N0:117) 

30 

Oligo #1289-87 



35 



5 • - CTA GTT TCT TCA TCA TAA TGA AGA TAT TTA GCA 
GGA AAA GTT TCC ATA TGT TAT TCC TCC TT - 3 ' 
(SEQ ID NO: 118) 
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Y > Human OPG met [22-1941 (P25A) 

A DNA sequence coding for an N-terminal 
methionine and amino acids 22 through 194 of human OPG 
5 with the proline at position 25 being substituted by 
alanine under control of the lux Pr promoter in 
prokaryotic expression vector pAMG21 was constructed as 
follows; The plasmids pAMG21-huOPG [27-194] and pAMG21- 
huOPGC22-401] (P25A) were each digested with Kpnl and 

10 BamHI endonucleases . The 450 bp fragment was isolated 
from pAMG21-huOPG[ 27-194] and the 6,1 kbp fragment was 
isolated from pAMG21-huOPG [22-401] (P25A) , These 
fragments were ligated together and introduced into E. 
coli host GM221 by transformation. Clones were 

15 initially screened for production of the recombinant 

protein. Plasmid DNA was isolated from positive clones 
and DNA sequencing was performed to verify the DNA 
sequence of the huOPG-Met [22-194] (P25A) gene. 

20 EXAMPLE 9 

Association of OPG Monomers 

CHO cells engineered to overexpress muOPG 
[22-401] were used to generate conditioned media for 

25 the analysis of secreted recombinant OPG using rabbit 
polyclonal anti-OPG antibodies. An aliquot of 
conditioned media was concentrated 2 0-fold, then 
analysed by reducing and non-reducing SDS-PAGE 
(Figure 15) . Under reducing conditions, the protein 

3 0 migrated as a Mr 50-55 kd polypeptide, as would be 

predicted if the mature product was glycosylated at one 
or more of its consensus N-linked glycosylation sites. 
Suprisingly, when the same samples were analysed by 
non-reducing SDS-PAGE, the majority of the protein 

3 5 migrated as an approximately 100 kd polypeptide, twice 
the size of the reduced protein. In addition, there 
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was a smaller amount of the Mr 50-55 kd polypeptide. 
This pattern of migration on SDS-PAGE was consistent 
with the notion that the OPG product was forming dimers 
through oxidation of a free sulfhydryl group (s). 
5 The predicted mature OPG polypeptide contains 

23 cysteine residues, 18 of which are predicted to be 
involved in forming intrachain disulfide bridges which 
comprise the four cysteine-rich domains (Figure 12A) . 
The five remaining C-terminal cysteine residues are not 

10 involved in secondary structure which can be predicted 
based upon homology with other TNFR family members. 
Overall there is a net uneven number of cysteine 
residues, and it is formally possible that at least one 
residue is free to form an intermolecular disulfide 

15 bond between two OPG monomers. 

To help elucidate patterns of OPG kinesis and 
monomer association, a pulse-chase labelling study was 
performed* CHO cells expressing muOPG [22-401] were 
metabolically labelled as described above in serum- free 

2 0 mediiim containing ^^S methionine and cysteine for 3 0 

min. After this period, the media was removed, and 
replaced with complete mediiam containing unlabelled 
methionine and cysteine at levels approximately 2,000- 
fold excess to the original concentration of 
25 radioactive amino acids. At 30 min, Ihr, 2 hr, 4 hr, 5 
hr and 12 hr post addition, cultures were harvested by 
the removal of the conditioned media, and lysates of 
the conditioned media and adherent monolayers were 
prepared. The culture media and cell lysates were 

3 0 clarified as described above, and then 

immunoprecipitated using anti-OPG antibodies as 
described above. After the immunoprecipitates were 
washed, they were released by boiling in non-reducing 
SDS-PAGE buffer then split into two equal halves. To 
3 5 one half, the reducing agent p -mere apt othanol was added 
to 5% (v/v) final concentration, while the other half 
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was maintained in non-reducing conditions. Both sets of 
immunoprecipitates were analysed by SDS-PAGE as 
described above, then processed for autoradiography and 
exposed to film. The results are shown in Figure 16. 
5 The samples analysed by reducing SDS-PAGE are depicted 
in the bottom two panels. After synthesis, the OPG 
polypeptide is rapidly processed to a slightly larger 
polypeptide, which probably represents modification by 
N-linked glycoslyation . After approximately 1-2 hours, 

10 the level of OPG in the cell decreases dramatically, 
and concomitantly appears in the culture supernatant. 
This appears to be the result of the vectoral transport 
of OPG from the cell into the media over time, 
consistent with the notion that OPG is a naturally 

15 secreted protein. Analysis of the same 

immunoprecipitates under nonreducing conditions reveals 
the relationship between the formation of OPG dimers 
and secretion into the conditioned media (Figure 16, 
upper panels) . In the first 3 0-60 minutes, OPG 

20 monomers are processed in the cell by apparent 

glycoslylation, followed by dimer formation. Over 
time, the bulk of OPG monomers are driven into dimers, 
which subsequently disappear from the cell. Beginning 
about 60 minutes after synthesis, OPG dimers appear in 

25 the conditioned media, and accumulate over the duration 
of the experiment. Following this period, OPG dimers 
are formed, which are then secreted into the culture 
media. OPG monomers persist at a low level inside the 
cell over time, and small amounts also appear in the 

3 0 media. This does not appear to be the result of 
breakdown of covalent OPG dimers, but rather the 
production of sub-stoichiometric amounts of monomers in 
the cell and subsequent secretion. 

Recombinantly produced OPG from transfected 

35 CHO cells appears to be predominantly a dimer. To 

determine if dimerization is a natural process in OPG 
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synthesis, we analysed the conditioned media of a cell 
line found to naturally express OPG, The CTLL-2 cell 
line, a murine cytotoxic T lymphocytic cell line {ATCC 
accession no. TIB-214) , was found to express OPG mRNA 
5 in a screen of tissue and cell line RNA. The OPG 

transcript was found to be the same as the cloned and 
sequenced 2.5-3.0 kb RNA identified from kidney and 
found to encode a secreted molecule. Western blot 
analysis of conditioned media obtained from CTLL-2 
10 cells shows that most, if not all, of the OPG protein 
secreted is a dimer (Figure 17), This suggests that 
OPG dimerization and secretion is not an artifact of 
overexpression in a cell line, but is likely to be the 
main form of the product as it is produced by 
15 expressing cells. 

Normal and transgenic mouse tissues and seriam 
were analysed to determine the nature of the OPG 
molecule expressed in OPG transgenic mice. Since the 
rat OPG cDNA was expressed under the control of a 
2 0 hepatocyte control element, extracts made from the 

parenchyma of control and transgenic mice under non- 
reducing conditions were analysed (Figure 18) . In 
extract from transgenic, but not control mice, OPG 
dimers are readily detected, along with 
25 substoichiometric amounts of monomers. The OPG dimers 
and monomers appear identical to the recombinant murine 
protein expressed in the genetically engineered CHO 
cells. This strongly suggests that OPG dimers are 
indeed a natural form of the gene product, and are 
30 likely to be key active components. Serum samples 
obtained from control and transgenic mice were 
similarly analysed by western blot analysis. In 
control mice, the majority of OPG protein migrates as a 
dimer, while small amounts of monomer are also 
35 detected. * In addition, significant amounts of a larger 
OPG related protein is detected, which migrates with a 
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relative molecular mass consistent with the predicted 
size of a covalently-linked trimer. Thus, recombinant 
OPG is expressed predominantly as a dimeric protein in 
OPG transgenic mice, and the dimer form may be the 
basis for the osteopetrotic phenotype in OPG mice. OPG 
recombinant protein may also exist in higher molecular 
weight "trimeric" forms. 

To determine if the five C-terminal cysteine 
residues of OPG play a role in homodimerization, the 
murine OPG codons for cytsteine residues 195 (C195) , 
C202, C277, C319, and C400 were changed to serine using 
the QuickChange™ Site-Directed Mutagenesis Kit 
(Stratagene, San Diego, CA) as described above. The 
muOPG gene was subcloned between the Not I and Xba I 
sites of the pcDNA 3.1 (+) vector (Invitrogen, San 
Diego, CA) . The resulting plasmid, pcDNAS . 1-muOPG, and 
mutagenic primers were treated with Pfu polymerase in 
the presence of deoxynucleo tides, then amplified in a 
thermocycler as described above. An aliqout of the 
reaction is then transfected into competent E, coli 
XLl-Blue by heat shock, then plated. Plasmid DNA from 
trans foxmants was then sequenced to verify mutations. 

The following primer pairs were used to 
change the codon for cysteine residue 195 to serine of 
the murine OPG gene, resulting in the production of a 
muOPG [22-401] C195S protein: 

1389-19: 

5' -CAC GCA AAA GTC GGG AAT AGA TGT CAC-3 ' 
(SEQ ID NO:150) 

1406-38: 

5' -GTG ACA TCT ATT CCC GAG TTT TGC GTG-3 ' 
(SEQ ID NO: 151) 
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The following primer pairs were used to 
change the codon for cysteine residue 202 to serine of 
the murine OPG gene, resulting in the production of a 
5 muOPG [22-401] C202S protein: 

1389-21: 

5' -CAC CCT GTC GGA AGA GGC CTT CTT C-3 ' 
(SEQ ID NO: 152) 

10 

1389-22: 

5' -GAA GAA GGC CTC TTC CGA CAG GGT G-3 ' (1389-22) 
(SEQ ID NO: 153) 

15 The following primer pairs were used to 

change the codon for cysteine residue 277 to serine of 
the murine OPG gene, resulting in the production of a 
muOPG [22-401] C277S protein: 

20 1389-23: 

5' -TGA CCT CTC GGA AAG CAG CGT GCA-3 ' 
(SEQ ID NO: 154) 

1389-24: 

25 5' -TGG ACG CTG CTT TCC GAG AGG TCA-3 ' 

(SEQ ID NO: 155) 

The following primer pairs were used to 
change the codon for cysteine residue 319 to serine of 
30 the murine OPG gene, resulting in the production of a 
muOPG [22-401] C319S protein: 



1389-17: 

5' -CCT CGA AAT CGA GCG AGC AGC TCC-3 ' 
35 (SEQ ID NO: 156) 
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1389-18: 

5 ' -CGA TTT CGA GGT CTT TCT CGT TCT C-3 ' 
(SEQ ID NO:157) 

5 

The following primer pairs were used to 
change the codon for cysteine residue 400 to serine of 
the murine OPG gene, resulting in the production of a 
muOPG [22-401] C400S protein: 

0 

1406-72: 

5' -CCG TGA AAA TAA GCT CGT TAT AAC TAG GAA TGG-3 ' 
(SEQ ID NO: 158) 



15 1406-75: 

5 ' -CCA TTC CTA GTT ATA ACG AGC TTA TTT TCA CGG-3 ' 
(SEQ ID NO: 159) 



Each resulting muOPG [22-401] plasmid 

20 containing the appropriate mutation was then 

transfected into human 293 cells, the mutant OPG-Fc 
fusion protein purified from conditioned media as 
described above. The biological activity of each 
protein was assessed the in vitro osteoclast forming 

25 assay described in example 11, Conditioned media from 
each trans fee tant was analysed by non-reducing SDS-PAGE 
and western blotting with anti-OPG antibodies. 

Mutation of any of the five C-terminal 
cysteine residues results in the production of 

30 predominantly (>90%) monomeric 55 led OPG molecules. 
This strongly suggests that the C-terminal cysteine 
residues together play a role in OPG homodimerization. 

C-terminal OPG deletion mutants were 
constructed to map the region (s) of the OPG C-terminal 

35 domain which are important for OPG homodimerization. 

These OPG mutants were constructed by PCR amplification 



- 114 - 



using primers which introduce premature stop 
translation signals in the C-terminal region of murine 
OPG. The 5' oligo was designed to the MuOPG start 
codon (containing a Hindlll restriction site) and the 
5 3' oligonucleotides (containing a stop codon and Xhol 
site) were designed to truncate the C-terminal region 
of muOPG ending at either threonine residue 200 (CT 
200), proline 212 (CT212), glutamic acid 293 (CT-293), 
or serine 355 (CT-355) . 
10 The following primers were used to construct 

muOPG [22-200] : 

1091-39: 

5' -CCT CTG AGC TCA AGC TTC CGA GGA CCA CAA TGA 
15 ACA AG-3' (SEQ ID NO; 160) 

1391-91: 

5' -CCT CTC TCG AGT CAG GTG ACA TCT ATT CCA CAC 
TTT TGC GTG GC-3 ' (13 91-91) (SEQ ID NO: 161) 

20 

The following primers were used* to construct muOPG 
[22-212] : 

1091-39: 

25 5' -CCT CTG AGC TCA AGC TTC CGA GGA CCA CAA TGA 

ACA AG-3' (SEQ ID NO: 162) 
1391-90: 

5' -CCT CTC TCG AGT CAA GGA ACA GCA AAC CTG AAG 
AAG GC -3' (SEQ ID NO: 163) 

30 

The following primers were used to construct muOPG 
[22-293] : 

1091-39: 

3 5 5' -CCT CTG AGC TCA AGC TTC CGA GGA CCA CAA TGA 

ACA AG-3' (SEQ ID NO:'l64) 
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1391-89: 

5 ' - CCT CTC TCG AGT CAC TCT GTG GTG AGG TTC GAG 
TGG CC-3' (SEQ ID NO: 165) 

The following primers were used to construct muOPG 
[22-355] : 

1091-39: 

5' -CCT CTG AGC TCA AGC TTC CGA GGA CCA CAA TGA 
ACA AG-3' (SEQ ID NO: 166) 

1391-88: 

5' CCT CTC TCG AGT CAG GAT GTT TTC AAG TGC TTG AGG GC- 
3 ' 

(SEQ ID NO: 167) 

Each resulting muOPG-CT plasmid containing 
the appropriate truncation was then transfected into 
human 293 cells, the mutant OPG-Fc fusion protein 
purified from conditioned media as described above. 
The biological activity of each protein was assessed 
the in vitro osteoclast forming assay described in 
example 11. The conditioned medias were also analysed 
by non-reducing SDS-PAGE and western blotting using 
anti-OPG antibodies. 

Truncation of the C-terminal region of OPG 
effects the ability of OPG to form homodimers . CT 355 
is predominantly monomeric, although some dimer is 
formed. CT 293 forms what appears to be equal molar 
amounts of monomer and dimer, and also high molecular 
weight aggregates. However, CT 212 and CT 200 are 
monomeric . 
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EXAMPLE 10 
Purification of OPG 

A- Purification of mammalian QPG-Fc Fusion Proteins 
5 5 L of conditioned media from 293 cells 

expressing an OPG-Fc fusion protein were prepared as 
follows, A frozen sample of cells was thawed into 10 
ml of 293S media (DMEM-high glucose, Ix L-glutamine, ' 
10% heat inactivated fetal bovine serum (FBS) and 100 

10 ug/ml hygromycin) and fed with fresh media after one 

day. After three days, cells were split into two T175 
flasks at 1:10 and 1:20 dilutions. Two additional 1:10 
splits were done to scale up to 200 T175 flasks. Cells 
were at 5 days post-thawing at this point. Cells were 

15 grown to near confluency (about three days) at which 
time serum-containing media was aspirated, cells were 
washed one time with 25 ml PBS per flask and 25 ml of 
SF media (DMEM-high glucose, Ix L-glutamine) was added 
to each flask. Cells were maintained at 5% C02 for 

2 0 three days at which point the media was harvested, 

centrifuged, and filtered through 0.45m cellulose 
nitrate filters (Corning) . 

OPG-Fc fusion proteins were purified using a 
Protein G Sepharose column (Pharmacia) equilibrated in 
25 PBS. The column size varied depending on volume of 
starting media. Conditioned media prepared as 
described above was loaded onto the colximn, the column 
washed with PBS, and pure protein eluted using lOOmM 
glycine pH 2.7. Fractions were collected into tubes 

3 0 containing IM Tris pH 9.2 in order to neutralize as 

quickly as possible. Protein containing fractions were 
pooled, concentrated in either an Amicon Centricon 10 
or Centriprep 10 and diafiltered into PBS. The pure 
protein is stored at -80^C. 
35 Murine [22-401] -Fc, Murine [22-180] -Fc, 

Murine [22-194] -Fc, human [22-401] -Fc and human [22- 
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2 01] Fc were purified by this procedure. Murine [22- 
185] -Fc is purified by tliis procedure. 

B. Preparation of anti-OPG antibodies 
5 Three New Zealand White rabbits (5-8 lbs 

initial wt) were injected subcutaneous ly with muOPG[22- 
401] -Fc fusion protein. Each rabbit was immunized on 
day 1 with 50 |Xg of antigen emulsified in an equal 
volume of Freunds complete adjuvant. Further boosts 

10 (Days 14 and 28) were performed by the same procedure 
with the substitution of Freunds incomplete adjuvant. 
Antibody titers were monitored by EIA. After the second 
boost, the antisera revealed high antibody titers and 
25ml production bleeds were obtained from each animal, 

15 The sera was first passed over an affinity column to 
which murine OPG-Fc had be immobilized. The anti-OPG 
antibodies were eluted with Pierce Gentle Elution 
Buffer containing 1% glacral acetic acid. The eluted 
protein was then dialyzed into PBS and passed over a Fc 

20 column to remove any antibodies specific for the Fc 
portion of the OPG fusion protein. The run through 
fractions containing anti-OPG specific antibodies were 
dialyzed into PBS, 

25 C. Purification of murine OPG [22-4011 

Antibodv Affinity Chromatography 

Affinity purified anti-OPG antibodies were 
diafiltered into coupling buffer (O.lM sodium carbonate 

3 0 pH 8.3, 0,5M NaCl) , and mixed with CNBr- activated 
sepharose beads (Pharmacia) for two hours at room 
temperature. The resin was then washed with coupling 
buffer extensively before bloclcing unoccupied sited 
with IM ethanolamine (pH 8.0) for two hours at room 

35 temperature. The resin was then washed with low pH 

(O.lM sodium acetate pH 4.0, 0 . 5M NaCl) followed by a 
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high pH wash (O.IM Tris-HCl pH 8.0, 0 . 5M NaCl) . The 
last washes were repeated three times. The resin was 
finally equilibrated with PBS before packing into a 
column. Once packed, the resin was washed with PBS. A 
5 blank elution was performed with O.IM glycine-HCl, 
pH 2.5), followed by re-equilibration with PBS. 

Concentrated conditioned media from CHO cells 
expressing muOPG [22-410] was applied to the column at a 
low flow rate. The column was washed with PBS until UV 

10 absorbance measured at 280nm returned to baseline. The 
protein was eluted from the column first with O.IM 
glycine-HCl (pH 2.5), re-equilibrated with PBS, and 
eluted with a second buffer (O.IM CAPS, pH 10.5), IM 
NaCl) . The two elution pools were diafiltered 

15 separately into PBS and sterile filtered before 
freezing at -20°C. 

Conventional Chroma toaraohv 

CHO cell conditioned media was concentrated 

20 23x in an Amicon spiral wound cartridge (SlOYlO) and 
diafiltered into 20mM tris pH 8,0* The diafiltered 
media was then applied to a Q-sepharose HP (Pharmacia) 
coluinn which had been equilibrated with 20mM tris pH 
8.0. The column was then washed until absorbence at 

25 2 80nm reached baseline. Protein was eluted with a 2 0 
column voliime gradient of 0-300mM NaCl in tris pH 8.0. 
OPG protein was detected using a western blot of column 
fractions , 

Fractions containing OPG were pooled and 
30 brought to a final concentration of 300mM NaCl, 0.2mM 
DTT. A NiNTA superose (Qiagen) coliimn was equilibrated 
with 20mM tris pH 8.0, 300mM NaCl, 0,2mM DTT after 
which the pooled fractions were applied. The column was 
washed with equilibration buffer until baseline 
3 5 absorbence was reached. Proteins were eluted from the 
column with a 0-3 OmM Imidazole gradient in 
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equilibration buffer. Remaining proteins were washed 
off the coliimn with IM Imidazole, Again a western blot 
was used to detect OPG containing fractions. 

Pooled fractions from the NiNTA column were 
5 dialyzed into 10mm potassiijm phosphate pH 7,0, 0 . 2mM 
DTT, The dialyzed pool was then applied to a ceramic 
hydroxyapatite column (Bio-Rad) which had been 
equilibrated in lOmM phosphate buffer. After column 
washing, the protein was eluted with a 10-lOOmM 

10 potassium phosphate gradient over 20 column volumes. 
This was then followed by a 20 column volume gradient 
of 100-400 mM phosphate. 

OPG was detected by coomassie blue staining 
of SDS-polyacrylamide gels and by western blotting. 

15 Fractions were pooled and diafiltered onto PBS and 

frozen at -80 ^C. The purified protein runs as a monomer 
and will remain so after diaf iltration into PBS. The 
monomer is stable when stored frozen or at pH 5 at 4°C. 
However if stored at 4^0 in PBS, dimers and what 

2 0 appears to be trimers and tetramers will form after one 

week, 

D. Purification of human OPG met [22-4011 from E. coli 
25 The bacterial cell paste was suspended into 

10 mM EDTA to a concentration of 15% (w/v) using a low 
shear homogenizer at 5^0, The cells were then 
disrupted by two homogenizations at 15,000 psi each at 
5^C. ^ The resulting homogenate was centrifuged at 5,000 

3 0 X g for one hour at 5°C. The centrifugal pellet was 

washed by low shear homogenization into water at the 
original homogenization volume followed by 
centrifugation as before. The washed pellet was then 
solubilized to 15% (w/v) by a solution of (final 
35 concentration) 5 M guanidine HCl, 10 mM dithiothreitol, 
10 mM TrisHCl, pH 8.5 at ambient temperature for 3 0 
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minutes. This solution was diluted 30-fold into 2M 
urea containing 50 mM CAPS, pH 10,5, 1 mM reduced 
glutathione and then stirred for 72 hours at 5^C, The 
OPG was purified from this solution at 25^C by first 
5 adjustment to pH 4.5 with acetic acid and then 

chromatography over a coliimn of SP-HP Sepharose resin 
equilibrated with 25 mM sodium acetate, pH 4.5. The 
column elution was carried out with a linear sodium 
chloride gradient from 50mMto550mMin the same 

10 buffer using 20 column volumes at a flow rate of 0.1 
column volumes /minute. The peak fractions containing 
only the desired OPG form were pooled and stored at 5^C 
or buffer exchanged into phosphate buffered saline, 
concentrated by ultrafiltration, and then stored at 

15 5^C. This material was analyzed by reverse phase HPLC, 
SDS-PAGE, limulus amebocyte lysate assay for the 
presence of endotoxin, and N- terminal sequencing. In 
addition, techniques such as mass spectrometry, 
pH/ temperature stability, fluoresence, circular 

2 0 dichroism, differential scanning calorimetiry , and 

protease profiling assays may also be used to examine 
the folded nature of the protein. 

25 EXAMPLE 11 

Biological Activity of Recombinant OPG 

Based on histology and histomorphometry, it 
appeared that hepatic overexpression of OPG in 

3 0 transgenic mice markedly decreased the numbers of 

osteoclasts leading to a marked increase in bone tissue 
(see Example 4) . To gain further insight into 
potential mechanism(s) underlying this in vivo effect, 
various forms of recombinant OPG have been tested in an 
3 5 in vitro culture model of osteoclast formation 

(osteoclast forming assay) . This culture system was 
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originally devised by Udagawa (Udagawa et al . 
Endocrinology 125, 1805-1813 (1989), Proc . Natl, Acad. 
Sci. USA 87, 7260-7264 (1990)) and employs a 
combination of bone marrow cells and cells from bone 
5 marrow stromal cell lines. A description of the 
modification of this culture system used for these 
studies has been previously published (Lacey et al. 
Endocrinology 136, 2367-2376 (1995)). In this method, 
bone marrow cells, flushed from the femurs and tibiae 

10 of mice, are cultured overnight in culture media (alpha 
MEM with 10% heat inactivated fetal bovine serum) 
supplemented with 500 U/ml CSF-1 (colony stimulating 
factor 1, also called M-CSF) , a hematopoietic growth 
factor specific for cells of the monocyte /macrophage 

15 family lineage. Following this incubation, the non- 
adherent cells are collected, subjected to gradient 
purification, and then cocultured with cells from the 
bone marrow cell line ST2 (1 x 10^ non-adherent cells : 
1 X 105 ST2 cells/ ml media) The media is 

2 0 supplemented with dexamethasone (100 nM) and the 

biologically-active metabolite of vitamin D3 known as 
1,25 dihydroxyvitamin D3 (1,25 {0H)2 D3 , 10 nM) , To 
enhance osteoclast appearance, prostaglandin E2 (250 
nM) is added to some cultures. The coculture period 
25 usually ranges from 8-10 days and the media, with all 
of the supplements freshly added, is renewed every 3-4 
days. At various intervals, the cultures are assessed 
for the presence of tartrate acid phosphatase (TRAP) 
using either a histochemical stain (Sigma Kit # 387A, 

3 0 Sigma, St. Louis, MO) or TRAP solution assay. The TRAP 

histochemical method allows for the identification of 
osteoclasts pheno typically which are multinucleated (^ 
3 nuclei) cells that are also TRAP+. The solution 
assay involves lysing the osteoclast-containing 
3 5 cultures in a citrate buffer (100 mM, pH 5.0) 

containing 0.1% Triton X-100. Tartrate resistant acid 
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phosphatase activity is then measured based on the 
conversion of p-nitrophenylphosphate (20 nM) to 
p-nitrophenol in the presence of 80 mM sodium tartrate 
which occurs during a 3-5 minute incubation at RT. The 
5 reaction is terminated by the addition of NaOH to a 
final concentration of 0.5 M. The optical density at 
405 nm is measured and the results are plotted. 

Previous studies (Udagawa et al. ibid ) using 
the osteoclast forming assay have demonstrated that 

10 these cells express receptors for ^^Sj^^alcitonin 

(autoradiography) and can make pits on bone surfaces, 
which when combined with TRAP positivity confirm that 
the multinucleated cells have an osteoclast phenotype. 
Additional evidence in support of the osteoclast 

15 phenotype of the multinucleated cells that arise in 
vitro in the osteoclast forming assay are that the 
cells express av and ^3 integrins by 

immunocytochemistry and calcitonin receptor and TRAP 
mRNA by in situ hybridization (ISH) . 

20 The huOPG [22-401] -Fc fusion was purified 

from CHO cell conditioned media and subsequently 
utilized in the osteoclast forming assay. At 100 ng/ml 
of huOPG [22-401] -Fc, osteoclast formation was 
virtually 100% inhibited (Figure 19A) . The levels of 

25 TRAP measured in lysed cultures in microti tre plate 

wells were also inhibited in the presence of OPG with 
an ID50 of approximately 3 ng/ml (Figure 20) . The 
level of TRAP activity in lysates appeared to correlate 
with the relative number of osteoclasts seen by TRAP 

3 0 cytochemistry (compare Figures 19A-19G and 20) , 

Purified human IgGl and TNFbp were also tested in this 
model and were found to have no inhibitory or 
stimulatory effects suggesting that the inhibitory 
effects of the huOPG [22-401] -Fc were due to the OPG 

3 5 portion of the fusion protein. Additional forms of the 
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human and murine molecules have been tested and the 
cumulative data are summarized in Table 1. 

5 Table 1 

Effects of various OPG forms on in vitro 
osteoclast formation 

10 OPG Construct Relative Bioactivitv in vitro 
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+++, ED^, = 0.4-2 ng/ml 

++, EDgo = 2-10 ng/ml 

+ , ED^^ = 10-100 ng/ml 

5 ED^^, > 100 ng/ml 



The cumulative data suggest that murine and 
human OPG amino acid sequences 22-401 are fully active 
in vitro , when either fused to the Fc domain, or 

10 unfused. They inhibit in a dose-dependent manner and 
possess half-maximal activities in the 2-10 ng/ml 
range. Truncation of the murine C-terminus at 
threonine residue 180 inactivates the molecule, whereas 
truncations at cysteine 185 and beyond have full 

15 activity. The cysteine residue located at position 185 
is predicted to form an SS3 bond in the domain 4 region 
of OPG. Removal of this residue in other TNFR-related 
proteins has previously been shown to abrogate 
biological activity (Yan et al. J. Biol. Chem. 266 , 

20 12099-12104 (1994)}. Our finding that muOPG[22-180] -Fc 
is inactive while muOPG[22-185] -Fc is active is 
consistent with these findings. This suggests that 
amino acid residues 22-185 define a region for OPG 
activity. 

25 These findings indicate that like 

transgenically-expressed OPG, recombinant OPG protein 
also suppressed osteoclast formation as tested in the 
osteoclast forming assay. Time course experiments 
examining the appearance of TRAP+ cells, p3+ cells, 

3 0 F480H- cells in cultures continuously exposed to OPG 
demonstrate that OPG blocks the appearance TRAP+ and 
p3+ cells, but not F480+ cells. In contrast, TRAP+ and 
P3+ cells begin to appear as early as day 4 following 
culture establishment in control cultures. Only F480+ 

3 5 cells can be found in OPG- treated cultures and they 
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appear to be present at qualitatively the same numbers 
as the control cultures. Thus, the mechanism of OPG 
effects in vitro appears to involve a blockade in 
osteoclast differentiation at a step beyond the 
5 appearance of monocyte-macrophages but before the 
appearance of cells expressing either TRAP or p3 
integrins. Collectively these findings indicate that 
OPG does not interfere with the general growth and 
differentiation of monocyte-macrophage precursors from 

10 bone marrow, but rather suggests that OPG specifically 
blocks the selective differentiation of osteoclasts 
from monocyte-macrophage precursors. 

To determine more specifically when in the 
osteoclast differentiation pathway that OPG was 

15 inhibitory, a variation of the in vitro culture method 
was employed. This variation, described in (Lacey et 
al , supra ) , employs bone marrow macrophages as 
osteoclast precursors. The osteoclast precursors are 
derived by taking the nonadherent bone marrow cells 

20 after -an overnight incubation in CSF-l/M-CSF, and 

culturing the cells for an additional 4 days with 1,000 
- 2,000 U/ml CSF-1. Following 4 days of culture, 
termed the growth phase, the non-adherent cells are 
removed. The adherent cells, which are bone marrow 

25 macrophages, can then be exposed for up to 2 days to 
various treatments in the presence of 1,000 - 2,000 
U/ml CSF-1. This 2 day period is called the 
intermediate differentiation period. Thereafter, the 
cell layers are again rinsed and then ST-2 cells (1 X 

30 10^ cell/ml), dexamethasone (100 nM) and 1,25 (0H)2 D3 
(10 nM) are added for the last 8 days for what is 
termed the terminal differentiation period. Test 
agents can be added during this terminal period as 
well. Acquisition of phenotypic markers of osteoclast 

3 5 differentiation are acquired during this terminal 
period (Lacey et al. ibid ) . 
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huOPG [22-401] -Fc (100 ng/ml) was tested for 
its effects on osteoclast formation in this model by 
adding it during either the intermediate, terminal or, 
alternatively, both differentiation periods. Both TRAP 
5 cytochemistry and solution assays were performed. The 
results of the solution assay are shown in Figure 21. 
HuOPG [22-401] -Fc inhibited the appearance of TRAP 
activity when added to both the intermediate and 
terminal or only the terminal differentiation phases. 

10 When added to the intermediate phase and then removed 

from the cultures by rinsing, huOPG [22-401] -Fc did not 
bloclc the appearance of TRAP activity in culture 
lysates. The cytochemistry results parallel the 
solution assay data. Collectively, these observations 

15 indicate that huOPG [22-401] -Fc only needs to be 

present during the terminal differentiation period for 
it to exert its all of its suppressive effects on 
osteoclast f oarmation. 

B. In viv o ILl-g and ILl-B challenge experiments 

2 0 ILI increases bone resorption both 

systemically and locally when injected subcutaneous ly 
over the calvaria of mice (Boyce et al., Endocrinology 
125, 1142-1150 (1989)). The systemic effects can be 
assessed by the degree of hypercalcemia and the local 
25 effects histologically by assessing the relative 

magnitude of the osteoclast-mediated response. The aim 
of these experiments was to determine if recombinant 
muOPG [22-401] -Fc could modify the local and/or 
systemic actions of ILI when injected subcutaneously 

3 0 over the same region of the calvaria as ILI. 

IL-1 P experiment 

Male mice (ICR Swiss white) aged 4 weelcs were 
divided into the following treatment groups (5 mice per 
3 5 group) : Control group: ILI treated animals (mice 

received 1 injection/day of 2.5 ug of IL1-|3) ; Low dose 
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muOPG [22-401] -Fc treated animals (mice received 3 
injections /day of 1 jig of muOPG [22-401] -Fc) ; Low dose 
muopg [22-401] -Fc and IL1-|3; High dose muOPG [22-401]- 
Fc treated animals (mice receive 3 injections/day of 10 
|ig muOPG [22-401] -Fc) ; Higli dose muOPG [22-401] -Fc and 
ILl-p, All mice received the same total number of 
injections of eitlier active factor or veliicle (0.1% 
bovine sertim albumin in phospliate buffered saline) . 
All groups are sacrificed on the day after the last 
injection. The weights and blood ionized calcium 
levels are measured before the first injections, four 
hours after the second injection and 24 hours after the 
third ILl injection, just before the animals were 
sacrificed. After sacrifice the calvaria were removed 
and processed for paraffin sectioning. 

ILl-g experiment 

Male mice (ICR Swiss white) aged 4 weelcs were 
divided into the following treatment groups (5 mice per 
group) : Control group; ILl alpha treated animals (mice 
received 1 injection/day of 5 ug of ILl -alpha ) ; Low 
dose muOPG [22-401] -Fc treated animals (mice received 1 
injection/day of 10 jig of muOPG [22-401] -Fc; Low dose 
muopg [22-401] -Fc and ILl-alpha, (dosing as above); 
High dose muopg [22-401] -Fc treated animals (mice 
received 3 injections /day of 10 |Lig muOPG [22-401] -Fc; 
High dose muOPG [22-401] -Fc and ILl-a, All mice 
received the same niimber of injections /day of either 
active factor or vehicle. All groups were sacrificed 
on the day after the last injection. The blood ionized 
calcium levels were measured before the first 
injection, four hours after the second injection and 24 
hours after the third ILl injection, just before the 
animals were sacrificed. The animal weights were 
measured before the first injection, four hours after 
the second injection and 24 hours after the third ILl 
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injection, just before the animals were sacrificed. 
After sacrifice the calvaria were removed and processed 
for paraffin sectioning. 

5 

Histological methods 

Calvarial bone samples were fixed in zinc 
formalin, decalcified in formic acid, dehydrated 
through ethanol and mounted in paraffin. Sections {5{im 

10 thick) were cut through the calvaria adjacent to the 

lambdoid suture and stained with either hematoxylin and 
eosin or reacted for tartrate resistant acid 
phosphatase activity (Sigma Kit# 387A) and 
counterstained with hematoxylin. Bone resorption was 

15 assessed in the ILl-a treated mice by histomorphometric 
methods using the Osteomeasure (Osteometries, Atlanta, 
GA) by tracing histologic features onto a digitizer 
platen using a microscope-mounted camera lucida 
attachment. Osteoclast numbers, osteoclast lined 

20 surfaces, and eroded surfaces were determined in the 

marrow spaces of the calvarial bone. The injected and 
non-injected sides of the calvaria were measured 
separately. 

25 Results 

ILl-a and ILl-P produced hypercalcemia at the 
doses used, particularly on the second day, presiimably 
by the induction of increased bone resorption 
systemically . The hypercalcemic response was blocked 
30 by muOPG [22-401] -Fc in the ILl-beta treated mice and 
significantly diminished in mice treated with ILl- 
alpha, an effect most apparent on day 2 (Figure 22A- 
22B) . 

Histologic analysis of the calvariae of mice 
3 5 treated with ILl -alpha and beta shows that ILl 

treatments alone produce a marked increase in the 
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indices of bone resorption including: osteoclast 
nximber, osteoclast lined surface, and eroded surface 
(surfaces showing deep scalloping due to osteoclastic 
action (Figure 23B, Table 2) . In response to ILl-a or 
5 ILl-p, the increases in bone resorption were similar on 
the injected and non-injected sides of the calvaria. 
Muopg [22-401] -Fc injections reduced bone resorption in 
both ILl -alpha and beta treated mice and in mice 
receiving vehicle alone but this reduction was seen 
10 only on the muopg [22-401] -Fc injected sides of the 
calvariae. 

The most lilcely explanation for these 
observations is that muOPG [22-401] -Fc inhibited bone 
resorption, a conclusion supported by the reduction of 

15 both the total osteoclast number and the percentage of 
available bone surface undergoing bone resorption, in 
the region of the calvaria adjacent to the muOPG [22- 
401] -Fc injection sites. The actions of muOPG [22- 
401] -Fc appeared to be most marlced locally by 

20 histology, but the fact that muOPG [22-401] -Fc also 

blunted ILl -induced hypercalcemia suggests that muOPG 
[22-401] -Fc has more subtle effects on bone resorption 
systemically . 
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C. Systemic Effects of muQPG [22-4011 -Fc in Growing 
Mice 

Male BDFl mice aged 3-4 weeks, weight range 
5 9.2- 15. 7g were divided into groups of ten mice per 
group. These mice were injected subcutaneous ly with 
saline or muOPG [22-401] -Fc 2.5mg/kg bid for 14 days 
(5mg/kg/day) , The mice were radiographed before 
treatment, at day 7 and on day 14. The mice were 

10 sacrificed 24 hours after the final injection. The 
right femur was removed, fixed in zinc formalin, 
decalcified in formic acid and embedded in paraffin. 
Sections were cut through the mid region of the distal 
femoral metaphysis and the femoral shaft. Bone 

15 density, by histomorphometry, was determined in six 

adjacent regions extending from the metaphyseal limit 
of the growth plate, through the primary and secondary 
spongiosa and into the femoral diaphysis (shaft) . Each 
region was 0.5 X 0.5 mm^ . 

20 

Radiographic changes 

After seven days of treatment there was 
evidence of a zone of increased bone density in the 
spongiosa associated with the growth plates in the OPG 

25 treated mice relative to that seen in the controls. 
The effects were particularly striking in the distal 
femoral and the proximal tibial metaphases (Figure 24A- 
24B) . However bands of increased density were also 
apparent in the vertebral bodies, the iliac crest and 

3 0 the distal tibia. At 14 days, the regions of opacity 

had extended further into the femoral and tibial shafts 
though the intensity of the radio-opacity was 
diminished. Additionally, there were no differences in 
the length of the femurs at the completion of the 

35 experiment or in the change in length over the duration 
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of the experiment implying that OPG does not alter bone 
growth . 

Histological Changes 
5 The distal femoral metaphysis showed 

increased bone density in a regions 1.1 to 2.65 mm in 
distance from the growth plate (Figures 25 and 26A- 
26B) . This is a region where bone is rapidly removed 
by osteoclast-mediated bone resorption in mice. In 
10 these rapidly growing young mice, the increase in bone 
in this region observed with OPG treatment is 
consistent with an inhibition of bone resorption. 

D. Effects of Osteoprotegerin on Bone Loss Induced bv 
15 Ovariectomv in the Rat 

Twelve week old female Fisher rats were 
ovariectomized (OVX) or sham operated and dual xray ^ 
absorptiometry (DEXA) measurements made of the bone 

20 density in the distal femoral metaphysis. After 3 
days recovery period, the animals received daily 
injections for 14 days as follows: Ten sham operated 
animals received vehicle (phosphate buffered saline) ; 
Ten OVX animals received vehicle (phosphate buffered 

25 saline) ; Six OVX animals received OPG-Fc 5mg/kg SC; 
Six OVX animals received pamidronate (PAM) 5mg/kg SC; 
Six OVX animals received estrogen (ESTR) 40ug/kg SC. 
After 7 and 14 days treatment the animals had bone 
density measured by DEXA. Two days after the last 

3 0 injection the animals were killed and the right tibia 
and femur removed for histological evaluation. 

The DEXA measurements of bone density showed a 
trend to reduction in the bone density following 
35 ovariectomy that was blocked by OPG-Fc. Its effects 
were similar to the known antiresorptive agents 
estrogen and pamidronate, (Figure 27) . The 
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histomorphometric analysis confirmed these observations 
with OPG-Fc treatment producing a bone density that was 
significantly higher in OVX rats than that seen in 
untreated OVX rats (Figure 28) . These results confirm 
5 the activity of OPG in the bone loss associated with 
withdrawal of endogenous estrogen following 
ovariectomy. 



10 In vivo Summary 

The in vivo actions of recombinant OPG 
parallel the changes seen in OPG transgenic mice. The 
reduction in osteoclast number seen in the OPG 
transgenic is reproduced by injecting recombinant OPG 

15 locally over the calvaria in both normal mice and in 
mice treated with ILl-a or ILl-p. The OPG transgenic 
mice develop an osteopetrotic phenotype with 
progressive filling of the marrow cavity with bone and 
unremodelled cartilage extending from the growth plates 

2 0 from day 1 onward after birth. In normal three week 

old (growing) mice, OPG treatments also led to 
retention of bone and unremodelled cartilage in regions 
of endochondral bone formation, an effect observed 
radiographically and confirmed histologically. Thus, 
25 recombinant OPG produces phenotypic changes in normal 

animals similar to those seen in the transgenic animals 
and the changes are consistent with OPG-induced 
inhibition of bone resorption. Based on in vitro assays 
of osteoclast formation, a significant portion of this 

3 0 inhibition is due to impaired osteoclast formation. 

Consistent with this hypothesis, OPG blocks 
ovariectomy- induced osteoporosis in rat. Bone loss in 
this model is known to be mediated by activated 
osteoclasts, suggesting a role for OPG in treatment of 
3 5 primary osteoporosis. 
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EXAMPLE 12 
Pegylation Derivatives of OPG 

5 

Preparation of N-terminal PEG-OPQ conjugates bv 
reductive alkvlation 

HuOPG met [22-194] P25A was buffer exchanged into 
25-50 mM NaOAc, pH 4.5-4.8 and concentrated to 2-5 

10 mg/ml. This solution was used to conduct OPG reductive 
alkylation with monof unctional PEG aldehydes at 5-7 ""C. 
PEG mono functional aldehydes, linear or branched, MW=1 
to 57 kDa (available from Shearwater Polymers) were 
added to the OPG solution as solids in amounts 

15 constituting 2-4 moles of PEG aldehyde per mole of OPG. 
After dissolution of polymer into the protein solution, 
sodium cyanoborohydride was added to give a final 
concentration of 15 to 2 0 mM in the reaction mixture 
from 1-1.6 M freshly prepared stock solution in cold DI 

2 0 water. The progress of the reaction and the extent of 

OPG PEGylation was monitored by size- exclusion HPLC on 
a G3000SWXL column (Toso Haas) eluting with 100 mM 
NaP04, 0.5 M NaCl, 10% ethanol, pH 6.9. Typically the 
reaction was allowed to proceed for 16-18 hours, after 
25 which the reaction mixture was diluted 5-8 times and 
the pH lowered to 3,5-4, The reaction mixture was 
fractionated by ion exchange chromatography (HP SP 
HiLoad 16/10, Pharmacia) eluting with 20 mM NaOAc pH 4 
with a linear gradient to 0.75M NaCl over 25 column 

3 0 volumes at a flow rate of 30 cm/h. Fractions of mono-, 

di- or poly- PEGylated OPG were pooled and 
characterized by SEC HPLC and SDS-PAGE. By N-terminal 
sequencing, it was determined that the monoPEG-OPG 
conjugate, the major reaction product in most cases, 
35 was 98% N-terminally PEG-modified OPG. 
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This procedure was generally used to prepare the 
following N-terminal PEG-OPG conjugates (where OPG is 
HuOPG met [22-194] P25A: 5 kD monoPEG, 10 kD mono 
branched PEG, 12 kD mono PEG, 20 kD monoPEG, 20 kD mono 
5 branched PEG, 25 kD monoPEG, 31 kD monoPEG, 57 kD 

monoPEG, 12 kD diPEG, 25 kD diPEG, 31 kD diPEG, 57 kD 
diPEG, 25 kD triPEG. 

Preparation of PEG-OPG conjugates bv acvlation 

10 HuOPG met [22-194] P25A was buffer exchanged into 

50 mM BICINE buffer, pH 8 and concentrated to 2-3 
mg/ml. This solution was used to conduct OPG acylation 
with monofunctional PEG N-hydroxysuccinimidyl esters at 
room temperature, PEG N-hydroxysuccinimidyl esters, 

15 linear or branched, MW=1 to 57 kDa (available from 

Shearwater Polymers) were added to the OPG solution as 
solids in amounts constituting 4-8 moles of PEG N- 
hydroxysuccinimidyl ester per mole of OPG, The progress 
of the reaction and the extent of OPG PEGylation was 

20 monitored by size exclusion HPLC on a G3 000SWxL column 
(Toso Haas) eluting with 100 mM NaP04, 0.5 M NaCl, 10% 
ethanol, pH 6.9. Typically the reaction was allowed to 
proceed for 1 hour, after which the reaction mixture 
was diluted 6-8 times and the pH lowered to 3,5-4. The 

25 reaction mixture was fractionated by ion exchange 

chromatography (HP SP HiLoad 16/10, Pharmacia) eluting 
with 20 mM NaOAc pH 4 with a linear gradient to 0.75M 
NaCl over 25 column volumes at a flow rate of 3 0 cm/h. 
Fractions of mono-, di- or poly- PEGylated OPG were 

3 0 pooled and characterized by SEC HPLC and SDS-PAGE. 

This procedure was generally used to prepare the • 
following PEG-OPG conjugates: 5 kD polyPEG, 20 IcD 
poly PEG, 40 kD poly branched PEG, 50 kD poly PEG. 
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Prepa ration of dimeric PEG-QPG 

HuOPG met [22-194] P25A is prepared for 
thiolation at 1-3 mg/ml in a phosphate buffer at near 
neutral pH. S-acetyl mecaptosuccinic anhydride (AMSA) 
is added in a 3-7 fold molar excess while maintaining 
pH at 7,0 and the rxn stirred at 4_C for 2 hrs • The 
monothiolated-OPG is separated from unmodified and 
polythiolated OPG by ion exchange chromatography and 
the protected thiol deprotected by treatment with 
hydroxylamine. After deprotection, the hydroxylamine 
is removed by gel filtration and the resultant 
monothiolated-OPG is subjected to a variety of thiol 
specific crosslinking chemistries. To generate a 
disulfide bonded dimer, the thiolated OPG at >lmg/ml is 
allowed to undergo air oxidation by dialysis in 
slightly basic phosphate buffer. The covalent 
thioether OPG dimer was prepared by reacting the bis- 
maleimide crosslinker, N, N-bis (3-maleimido propianyl)- 
2 -hydroxy 1,3 propane with the thiolated OPG at >lmg/ml 
at a 0.6x molar ratio of crosslinker : OPG in phosphate 
buffer at pH 6.5. Similarly, the PEG dumbbells are 
produced by reaction of subs toichiome trie amounts of 
bis-maleimide PEG crosslinkers with thiolated OPG at 
>lmg/ml in phosphate buffer at pH 6.5. Any of the 
above dimeric conjugates may be further purified using 
either ion exchange or size exclusion chromatographies. 

Dimeric PEG-OPG conjugates (where OPG is HuOPG met 
[22-194] P25A prepared using the above procedures 
include disulf ide-bonded OPG dimer, covalent thioether 
OPG dimer with an aliphatic amine type crosslinker, 3.4 
kD and 8kD PEG dumbbells and monobells. 

PEG-OPG conjugates were tested for activity in 
vi^^^Q using the osteoclast maturation assay described 
in Example llA and for activity in vivo by measuring 
increased bone density after Injection into mice as 
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described in Example IIC. The in vivo activity is 
shown below in Table 3 . 

Table 3 

In vivo biological activity of Pegylated OPG 
QPG Construct 
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* ★ ★ 

While the invention has been described in 
what is considered to be its preferred embodiments, it 
5 is not to be limited to the disclosed embodiments, but 
on the contrary, is intended to cover various 
modifications and equivalents included within the 
spirit and scope of the appended claims, which scope is 
to be accorded the broadest interpretation so as to 
0 encompass all such modifications and equivalents. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT :Amgen Inc. 

(ii) TITLE OF INVENTION: OSTEOPROTEGERIN 
(iii) NUMBER OF SEQUENCES: 168 

15 (iv) CORRESPONDENCE ADDRESS; 

(A) ADDRESSEE: Amgen Inc. 

(B) STREET: 1840 Dehavilland Drive 

(C) CITY: Thousand Oaks 

(D) STATE: California 

20 (E) COUNTRY: United States 

(F) ZIP: 91320 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
25 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOSyMS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 
3 0 (A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(Viii) ATTORNEY /AGENT INFORMATION: 
35 (A) NAME: Winter, Robert B. 

(C) REFERENCE /DOCKET NUMBER: A-378-CIP2 



(2) INFORMATION FOR SEQ ID N0:1: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
AAAGGAAGGA AAAAAGCGGC CGCTACANNN NNNNNT 3 6 

5 (2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



50 



(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

20 TCGACCCACG CGTCCG 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

3 0 (ii) MOLECULE TYPE: cDNA 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

GGGTGCGCAG GC 12 
(2) INFORMATION FOR SEQ ID NO: 4: 

40 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
CB) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

TGTAAAACGA CGGCCAGT 18 
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(2) INFORMATION FOR SEQ ID NO; 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
CAGGAAACAG CTATGACC 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 
CAATTAACCC TCACTAAAGG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) ^ TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GCATTATGAC CCAGAAACCG GAC 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

10 

AGGTAGCGCC CTTCCTCACA TTC 23 
(2) INFORMATION FOR SEQ ID NO: 9: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

25 

GACTAGTCCC ACAATGAACA AGTGGCTGTG 30 
(2) INFORMATION FOR SEQ ID NO: 10: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) T0P0L(X3Y: linear 



35 



(ii) MOLECULE TYPE: cDNA 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ATAAGAATGC GGCCGCTAAA CTATGAAACA GCCCAGTGAC CATTC 45 
(2) INFORMATION FOR SEQ ID NO: 11: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 '(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCCTCTAGAA AGAGCTGGGA C 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CGCCGTGTTC CATTTATGAG C 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATCAAAGGCA GGGCATACTT CCTG 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
GTTGCACTCC TGTTTCACGG TCTG 
(2) INFORMATION FOR SEQ ID NO: 15: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CAAGACACCT TGAAGGGCCT GATG 2 4 

15 

(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
2 0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



30 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TAACTTTTAC AGAAGAGCAT CAGC 24 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
45 AGCGCGGCCG CATGAACAAG TGGCTGTGCT GCG 33 



(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



10 



20 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 

AGCTCTAGAG AAACAGCCCA GTGACCATTC C 31 
(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GTGAAGCTGT GCAAGAACCT GATG 2 4 

(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
3 0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
ATCAAAGGCA GGGCATACTT CCTG 24 
(2) INFORMATION FOR SEQ ID NO: 21: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 



50 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

5 CAGATCCTGA AGCTGCTCAG TTTG 24 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: cDNA 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
AGCGCGGCCG CGGGGACCAC AATGAACAAG TTG 33 
(2) INFORMATION FOR SEQ ID NO: 23: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: cDNA 



35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
AGCTCTAGAA TTGTGAGGAA ACAGCTCAAT GGC 33 
40 (2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) ~ LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
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ATAGCGGCCG CTGAGCCCAA ATCTTGTGAC AAAACTCAC 39 
(2) INFORMATION FOR SEQ ID NO: 25: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



15 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

TCTAGAGTCG ACTTATCATT TACCCGGAGA CAGGGAGAGG CTCTT 45 
(2) INFORMATION FOR SEQ ID NO: 26: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CCTCTGAGCT CAAGCTTCCG AGGACCACAA TGAACAAG 3 8 

(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 43 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CCTCTGCGGC CGCTAAGCAG CTTATTTTCA CGGATTGAAC CTG 
(2) INFORMATION FOR SEQ ID NO: 28: 



43 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
CCTCTGAGCT CAAGCTTCCG AGGACCACAA TGAACAAG 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
TCCGTAAGAA ACAGCCCAGT GACC 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CCTCTGCGGC CGCTGTTGCA TTTCCTTTCT G 
(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 19 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Glu Thr Leu Pro Pro Lys Tyr Leu His Tyr Asp Pro Glu Thr Gly His 
15 10 15 

15 Gin Leu Leu 



(2) INFORMATION FOR SEQ ID NO: 32: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: CDNA 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
TCCCTTGCCC TGACCACTCT T 21 
3 5 (2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
50 CCTCTGCGGC CGCACACACG TTGTCATGTG TTGC 
(2) INFORMATION FOR SEQ ID NO: 34: 



34 
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10 



15 



25 



30 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
TCCCTTGCCC TGACCACTCT T 21 
(2) INFORMATION FOR SEQ ID NO: 35: 



{i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 
CCTCTGCGGC CGCCTTTTGC GTGGCTTCTC TGTT 34 
(2) INFORMATION FOR SEQ ID NO: 36: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

CCTCTGAGCT CAAGCTTGGT TTCCGGGGAC CACAATG 37 

50 (2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: cDNA 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

CCTCTGCGGC CGCTAAGCAG CTTATTTTTA CTGAATGG 3 8 

(2) INFORMATION FOR SEQ ID NO: 38: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CCTCTGAGCT CAAGCTTGGT TTCCGGGGAC CACAATG 3 7 

(2) INFORMATION FOR SEQ ID NO: 39: 



30 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



40 



45 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CCTCTGCGGC CGCCAGGGTA ACATCTATTC CAC 33 
(2) INFORMATION FOR SEQ ID NO: 40: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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10 



20 



25 



35 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CCGAAGCTTC CACCATGAAC AAGTGGCTGT GCTGC 
C2) INFORMATION FOR SEQ ID NO: 41: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CCTCTGTCGA CTATTATAAG CAGCTTATTT TCACGGATTG 
(2) INFORMATION FOR SEQ ID NO: 42: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



40 



(ii) MOLECULE TYPE: cDNA 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 42 : 
TCCCTTGCCC TGACCACTCT T 
45 (2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
50 -(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

5 

CCTCTGTCGA CTTAACACAC GTTGTCATGT GTTGC 3 5 

(2) INFORMATION FOR SEQ ID NO: 44: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 
■ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: cDNA 



20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
TCCCTTGCCC TGACCACTCT T 21 
25 (2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

3 0 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

40 CCTCTGTCGA CTTACTTTTG CGTGGCTTCT CTGTT 3 5 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1537 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



50 



{ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 







GTGAAGAGCG 


TGAAGAGCGG 


TTCCTCCTTT 


CAGCAAAAAA 


CCCCTCAAGA 


CCCGTTTAGA 


60 




5 


GGCCCCAAGG 


GGTTATGCTA 


GTTATTGCTC 


AGCGGTGGCA 


GCAGCCAACT 


CAGCTTCCTT 


120 






TCGGGCTTTC 


TTCTTCTTCT 


TCTTCTTTCC 


GCGGATCCTC 


GAGTAAGCTT 


CCATGGTACC 


180 




10 


CTGCAGGTCG 


ACACTAGTGA 


GCTCGAATTC 


CAACGCGTTA 


ACCATATGTT 


ATTCCTCCTT 


240 




TAATTAGTTA 


AAACAAATCT 


AGAATCAAAT 


CGATTAATCG 


ACTATAACAA 


ACCATTTTCT 


300 






TGCGTAAACC 


TGTACGATCC 


TACAGGTACT 


TATGTTAAAC 


AATTGTATTT 


CAAGCGATAT 


360 




15 


AATAGTGTGA 


CAAAAATCCA 


ATTTATTAGA 


ATCAAATGTC 


AATCTATTAC 


CGTTTTAATG 


420 






ATATATAACA 


CGCAAAACTT 


GCGACAAACA 


ATAGGTAAGG 


ATAAAGAGAT 


GGGTATGAAA 


480 




20 


GACATAAATG 


CCGACGACAC 


TTACAGAATA 


ATTAATAAAA 


TTAAAGCCTG 


TAGAAGCAAT 


540 




AATGATATTA 


ATCAATGCTT 


ATCTGATATG 


ACTAAAATGG 


TACATTGTGA 


ATATTATTTA 


600 


i y 




CTCGCGATCA 


TTTATCCTCA 


TTCTATGGTT 


AAATCTGATA 


TTTCAATTCT 


GGATAATTAC 


660 


g 


25 


CCTAAAAAAT 


GGAGGCAATA 


TTATGATGAC 


GCTAATTTAA 


TAAAATATGA 


TCCTATAGTA 


720 






GATTATTCTA 


ACTCCAATCA 


TTCACCGATT 


AATTGGAATA 


TATTTGAAAA 


CAATGCTGTA 


780 




30 


AATAAAAAAT 


CTCCAAATGT 


AATTAAAGAA 


GCGAAATCAT 


CAGGTCTTAT 


CACTGGGTTT 


840 




AGTTTCCCTA 


TTCATACTGC 


TAATAATGGC 


TTCGGAATGC 


TTAGTTTTGC 


ACATTCAGAG 


900 






AAAGACAACT 


ATATAGATAG 


TTTATTTTTA 


CATGCGTGTA 


TGAACATACC 


ATTAATTGTT 


960 




35 


CCTTCTCTAG 


TTGATAATTA 


TCGAAAAATA 


AATATAGCAA 


ATAATAAATC 


AAACAACGAT 


1020 






TTAACCAAAA 


GAGAAAAAGA ATGTTTAGCG 


TGGGCATGCG 


AAGGAAAAAG 


CTCTTGGGAT 


1080 




40 


ATTTCAAAAA 


TATTAGGCTG 


TAGTAAGCGC 


ACGGTCACTT 


TCCATTTAAC 


CAATGCGCAA 


1140 




ATGAAACTCA 


ATACAACAAA 


CCGCTGCCAA 


AGTATTTCTA 


AAGCAATTTT 


AACAGGAGCA 


1200 






ATTGATTGCC 


CATACTTTAA 


AAGTTAAGTA 


CGACGTCCAT 


ATTTGAATGT 


ATTTAGAAAA 


1260 




45 


ATAAACAAAA 


GAGTTTGTAG 


AAACGCAAAA 


AGGCCATCCG 


TCAGGATGGC 


CTTCTGCTTA 


1320 






ATTTGATGCC 


TGGCAGTTTA 


TGGCGGGCGT 


CCTGCCCGCC 


ACCCTCCGGG 


CCGTTGCTTC 


1380 




50 


GCAACGTTCA 


AATCCGCTCC 


CGGCGGATTT 


GTCCTACTCA 


GGAGAGCGTT 


CACCGACAAA 


1440 




CAACAGATAA 


AACGAAAGGC 


CCAGTCTTTC 


GACTGAGCCT 


TTCGTTTTAT 


TTGATGCCTG 


1500 






GCAGTTCCCT 


ACTCTCGCAT 


GGGGAGACCA 


TGCATAC 






1537 
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(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: cDNA 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
CCGGCGGACA TTTATCACAC AGCAGCTGAT GAGAAGTTTC TTCATCCA 
2 0 (2) INFORMATION FOR SEQ ID NO; 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

35 CGATTTGATT CTAGAAGGAG GAATAACATA TGGTTAACGC GTTGGAATTC GGTAC 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQtJENCE CHARACTERISTICS: 
40 (A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: cDNA 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

CGAATTCCAA CGCGTTAACC ATATGTTATT CCTCCTTCTA GAATCAAAT 



49 
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(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1546 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

10 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

15 

GCGTAACGTA TGCATGGTCT CCCCATGCGA GAGTAGGGAA CTGCCAGGCA TCAAATAAAA 60 

CGAAAGGCTC AGTCGAAAGA CTGGGCCTTT CGTTTTATCT GTTGTTTGTC GGTGAACGCT 120 

20 CTCCTGAGTA GGACAAATCC GCCGGGAGCG GATTTGAACG TTGCGAAGCA ACGGCCCGGA 180 

GGGTGGCGGG CAGGACGCCC GCCATAAACT GCCAGGCATC AAATTAAGCA GAAGGCCATC 240 

CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTTTGT TTATTTTTCT AAATACATTC 300 

25 

AAATATGGAC GTCGTACTTA ACTTTTAAAG TATGGGCAAT CAATTGCTCC TGTTAAAATT 360 

GCTTTAGAAA TACTTTGGCA GCGGTTTGTT GTATTGAGTT TCATTTGCGC ATTGGTTAAA 420 

3 0 TGGAAAGTGA CCGTGCGCTT ACTACAGCCT AATATTTTTG AAATATCCCA AGAGCTTTTT 480 

CCTTCGCATG CCCACGCTAA ACATTCTTTT TCTCTTTTGG TTAAATCGTT GTTTGATTTA 540 

TTATTTGCTA TATTTATTTT TCGATAATTA TCAACTAGAG AAGGAACAAT TAATGGTATG 600 

35 

TTCATACACG CATGTAAAAA TAAACTATCT ATATAGTTGT CTTTCTCTGA ATGTGCAAAA 660 

CTAAGCATTC CGAAGCCATT ATTAGCAGTA TGAATAGGGA AACTAAACCC AGTGATAAGA 720 

40 CCTGATGATT TCGCTTCTTT AATTACATTT GGAGATTTTT TATTTACAGC ATTGTTTTCA 780 

AATATATTCC AATTAATCGG TGAATGATTG GAGTTAGAAT AATCTACTAT AGGATCATAT 840 

TTTATTAAAT TAGCGTCATC ATAATATTGC CTCCATTTTT TAGGGTAATT ATCCAGAATT 900 

45 

GAAATATCAG ATTTAACCAT AGAATGAGGA TAAATGATCG CGAGTAAATA ATATTCACAA 960 

TGTACCATTT TAGTCATATC AGATAAGCAT TGATTAATAT CATTATTGCT TCTACAGGCT 102 0 

50 TTAATTTTAT TAATTATTCT GTAAGTGTCG TCGGCATTTA TGTCTTTCAT ACCCATCTCT 1080 

TTATCCTTAC CTATTGTTTG TCGCAAGTTT TGCGTGTTAT ATATCATTAA AACGGTAATA 1140 
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10 



GATTGACATT TGATTCTAAT AAATTGGATT TTTGTCACAC TATTATATCG CTTGAAATAC 1200 

AATTGTTTAA CATAAGTACC TGTAGGATCG TACAGGTTTA CGCAAGAAAA TGGTTTGTTA 1260 

TAGTCGATTA ATCGATTTGA TTCTAGATTT GTTTTAACTA ATTAAAGGAG GAATAACATA 1320 

TGGTTAACGC GTTGGAATTC GAGCTCACTA GTGTCGACCT GCAGGGTACC ATGGAAGCTT 13 80 

ACTCGAGGAT CCGCGGAAAG AAGAAGAAGA AGAAGAAAGC CCGAAAGGAA GCTGAGTTGG 1440 

CTGCTGCCAC CGCTGAGCAA TAACTAGCAT AACCCCTTGG GGCCTCTAAA CGGGTCTTGA 1500 

GGGGTTTTTT GCTGAAAGGA GGAACCGCTC TTCACGCTCT TCACGC 1546 
15 (2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

3 0 TATGAAACAT CATCACCATC ACCATCATGC TAGCGTTAAC GCGTTGG 47 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 
3 5 (A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 (ii) MOLECXXLE TYPE: cDNA 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

AATTCCAACG CGTTAACGCT AGCATGATGG TGATGGTGAT GATGTTTCA 49 



50 



(2) INFORMATION FOR SEQ ID NO: 53: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 141 base- pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
CTAATTCCGC TCTCACCTAC CAAACAATGC CCCCCTGCAA AAAATAAATT CATATAAAAA 60 
15 ACATACAGAT AACCATCTGC GGTGATAAAT TATCTCTGGC GGTGTTGACA TAAATACCAC 12 0 

TGGCGGTGAT ACTGAGCACA T 2_4-j_ 
(2) INFORiyiATION FOR SEQ ID NO: 54: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
CGATGTGCTC AGTATCACCG CCAGTGGTAT TTATGTCAAC ACCGCCAGAG ATAATTTATC 60 

35 

ACCGCAGATG GTTATCTGTA TGTTTTTTAT ATGAATTTAT TTTTTGCAGG GGGGCATTGT 120 
• TTGGTAGGTG AGAGCGGAAT TAGACGT 147 
40 (2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS": 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
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15 



35 



CGATTTGATT CTAGAAGGAG GAATAACATA TGGTTAACGC GTTGGAATTC GGTAC 55 



(2) INFORMATION FOR SEQ ID NO: 56: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 



(ii) MOLECULE TYPE: cDNA 



20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
CGAATTCCAA CGCGTTAACC ATATGTTATT CCTCCTTCTA GAATCAAAT 49 
25 (2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 

(B) TYPE: nucleic acid 

3 0 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
40 GTGAAGAGCG TGAAGAGCGG TTCCTCCTTT CAGCAAAAAA CCCCTCAAGA CCCGTTTAGA 

GGCCCCAAGG GGTTATGCTA GTTATTGCTC AGCGGTGGCA GCAGCCAACT CAGCTTCCTT 120 

TCGGGCTTTC TTCTTCTTCT TCTTCTTTCC GCGGATCCTC GAGTAAGCTT CCATGGTACC 180 

45 

CTGCAGGTCG ACACTAGTGA GCTCGAATTC CAACGCGTTA ACCATATGTT ATTCCTCCTT 240 

TAATTAGTTA ACTCAAATCT AGAATCAAAT CGATAAATTG TGAGCGCTCA CAATTGAGAA 3 00 

50 TATTAATCAA GAATTTTAGC ATTTGTCAAA TGAATTTTTT AAAAATTATG AGACGTCCAT 3 60 

ATTTGAATGT ATTTAGAAAA ATAAACAAAA GAGTTTGTAG AAACGCAAAA AGGCCATCCG 42 0 
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10 



20 



25 



TCAGGATGGC CTTCTGCTTA ATTTGATGCC TGGCAGTTTA TGGCGGGCGT CCTGCCCGCC 480 

ACCCTCCGGG CCGTTGCTTC GCAACGTTCA AATCCGCTCC CGGCGGATTT GTCCTACTCA 540 

GGAGAGCGTT CACCGACAAA CAACAGATAA AACGAAAGGC CCAGTCTTTC GACTGAGCCT 600 

TTCGTTTTAT TTGATGCCTG GCAGTTCCCT ACTCTCGCAT GGGGAGACCA TGCATACGTT 660 
ACGCACGT 

(2) INFORMATION FOR SEQ ID NO: 58: 



{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 726 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDSDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



668 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

GCGTAACGTA TGCATGGTCT CCCCATGCGA GAGTAGGGAA CTGCCAGGCA TCAAATAAAA 60 

CGAAAGGCTC AGTCGAAAGA CTGGGCCTTT CGTTTTATCT GTTGTTTGTC GGTGAACGCT 120 

3 0 CTCCTGAGTA GGACAAATCC GCCGGGAGCG GATTTGAACG TTGCGAAGCA ACGGCCCGGA 180 

GGGTGGCGGG CAGGACGCCC GCCATAAACT GCCAGGCATC AAATTAAGCA GAAGGGGCCT 240 

CCCACCGCCC GTCCTGCGGG CGGTATTTGA CGGTCCGTAG TTTAATTCGT CTTCGCCATC 300 

35 

CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTTTGT TTATTTTTCT AAATACATTC 3 60 

AAATATGGAC GTCTCATAAT TTTTAAAAAA TTCATTTGAC AAATGCTAAA ATTCTTGATT 420 

40 AATATTCTCA ATTGTGAGCG CTCACAATTT ATCGATTTGA TTCTAGATTT GTTTTAACTA 480 

ATTAAAGGAG GAATAACATA TGGTTAACGC GTTGGAATTC GAGCTCACTA GTGTCGACCT 540 

GCAGGGTACC ATGGAAGCTT ACTCGAGGAT CCGCGGAAAG AAGAAGAAGA AGAAGAAAGC 600 

45 

CCGAAAGGAA GCTGAGTTGG CTGCTGCCAC CGCTGAGCAA TAACTAGCAT AACCCCTTGG 660 

GGCCTCTAAA CGGGTCTTGA GGGGTTTTTT GCTGAAAGGA GGAACCGCTC TTCACGCTCT 72 0 

50 TCACGC 72 6 
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(2) INFORMATION FOR SEQ ID NO: 59: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



15 



20 



30 



35 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
TACGCACTGG ATCCTTATAA GCAGCTTATT TTTACTGATT GGAC 44 
(2) INFORMATION FOR SEQ ID NO: 60: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
GTCCTCCTGG TACCTACCTA AAACAAC 27 
(2) INFORMATION FOR SEQ ID NO: 61: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base pairs 

(B) TYPE: nucleic acid 

(C) ' STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
TATGGATGAA GAAACTTCTC ATCAGCTGCT GTGTGATAAA TGTCCGCCGG GTACCCGGCG 60 
GACATTTATC ACACAGCAGC TGATGAGAAG TTTCTTCATC CA 102 
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(2) INFORMATION FOR SEQ ID NO: 62: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

15 

Met Asp Glu Glu Thr Ser His Gin Leu Leu Cys Asp Lys Cys Pro Pro 
15 10 15 

Gly Thr Tyr 

20 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CPiARACTERISTICS : 
25 (A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

3 0 (ii) MOLECULE TYPE: cDNA 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

TATGGAAACT TTTCCTCCAA AATATCTTCA TTATGATGAA GAAACTTCTC ATCAGCTGCT 60 
GTGTGATAAA TGTCCGCCGG GTAC 04 

40 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 78 base pairs 
45 (B) TYPE: nucleic acid 

CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 



50 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 64: 
CCGGCGGACA TTTATCACAC AGCAGCTGAT GAGAAGTTTC TTCATCATAA TGAAGATATT 
5 TTGGAGGAAA AGTTTCCA 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 44 base pairs 

CB) TYPE: nucleic acid 
- (C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: cDNA 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

TACGCACTGG ATCCTTATAA GCAGCTTATT TTCACGGATT GAAC 44 
(2) INFORMATION FOR SEQ ID NO: 66: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 {D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
GTGCTCCTGG TACCTACCTA AAACAGCACT GCACAGTG 3 8 

(2) INFORMATION FOR SEQ ID NO: 67: 



(i) SEQUENCE CHARACTERISTICS:- 
(A) LENGTH: 84 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



50 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

TATGGAAACT CTGCCTCCAA AATACCTGCA TTACGATCCG GAAACTGGTC ATCAGCTGCT 6 0 

5 GTGTGATAAA TGTGCTCCGG GTAC 84 

(2) INFORMATION FOR SEQ ID NO; 68; 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: cDNA 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO; 68: 

CCGGAGCACA TTTATCACAC AGCAGCTGAT GACCAGTTTC CGGATCGTAA TGCAGGTATT 60 
TTGGAGGCAG AGTTTCCA 78 
(2) INFORMATION FOR SEQ ID NO: 69: 



25 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 54 base pairs 
3 0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

40 TATGGACCCA GAAACTGGTC ATCAGCTGCT GTGTGATAAA TGTGCTCCGG GTAC 54 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



50 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

CCGGAGCACA TTTATCACAC AGCAGCTGAT GACCAGTTTC TGGGTCCA 48 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 87 base pairs 
10 (B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

20 

TATGAAAGAA ACTCTGCCTC CAAAATACCT GCATTACGAT CCGGAAACTG GTCATCAGCT 60 
GCTGTGTGAT AAATGTGCTC CGGGTAC 87 
25 (2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

3 0 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



45 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
40 CCGGAGCACA TTTATCACAC AGCAGCTGAT GACCAGTTTC CGGATCGTAA TGCAGGTATT 60 
TTGGAGGCAG AGTTTCTTTC A 81 



(2) INFORMATION FOR SEQ ID NO: 73: 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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10 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
GTTCTCCTCA TATGAAACAT CATCACCATC ACCATCATGA AACTCTGCCT CCAAAATACC 60 
TGCATTACGA T 

(2) INFORMATION FOR SEQ ID NO: 74: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STHANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



40 



(ii) MOLECULE TYPE: cDNA 



25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
GTTCTCCTCA TATGAAAGAA ACTCTGCCTC CAAAATACCT GCA 43 
3 0 (2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

3 5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

TACGCACTGG ATCCTTAATG ATGGTGATGG TGATGATGTA AGCAGCTTAT TTTCACGGAT 60 

45 TGAACCTGAT TCCCTA 76 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear . 
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10 



20 



25 



35 



(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
GTTCTCCTCA TATGAAATAC CTGCATTACG ATCCGGAAAC TGGTCAT 47 

(2) INFORMATION FOR SEQ ID NO: 77: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 43 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
GTTCTCCTAT TAATGAAATA TCTTCATTAT GATGAAGAAA CTT 43 
(2) INFORMATION FOR SEQ ID NO: 78: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
TACGCACTGG ATCCTTATAA GCAGCTTATT TTTACTGATT 40 
45 (2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 
50 .(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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20 



25 



35 



40 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 
GTTCTCCTCA TATGGAAACT CTGCCTCCAA AATACCTGCA 40 
(2) INFORMATION FOR SEQ ID NO: 80: 



(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
TACGCACTGG ATCCTTATGT TGCATTTCCT TTCTGAATTA GCA 43 
(2) INFORMATION FOR SEQ ID NO: 81: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
3 0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
CCGGAAACAG ATAATGAG 13 
(2) INFORMATION FOR SEQ ID NO: 82: * 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

5 GATCCTCATT ATCTGTTT 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: cDNA 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

CCGGAAACAG AGAAGCCACG CAAAAGTAAG 3 0 

(2) INFORMATION FOR SEQ ID NO: 84: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
3 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
GATCCTTACT TTTGCGTGGC TTCTCTGTTT 30 
(2) INFORMATION FOR SEQ ID NO: 85: 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



50 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
TATGTTAATG AG 
5 (2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

20 GATCCTCATT AACA 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

3 0 (ii) MOLECULE TYPE: cDNA 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

TATGTTCCGG AAACAGTTAA G 21 
(2) INFORMATION FOR SEQ ID NO: 88: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
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10 



GATCCTTAAC TGTTTCCGGA ACA 23 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
TATGTTCCGG AAACAGTGAA TCAACTCAAA AATAAG 3 6 

20 (2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

35 GATCCTTATT TTTGAGTTGA TTCACTGTTT CCGGAACA 38 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) ' TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
50 CTAGCGACGA CGACGACAAA GAAACTCTGC CTCCAAAATA CCTGCATTAC GATCCGGAAA 
CTGGTCATCA GCTGCTGTGT GATAAATGTG CTCCGGGTAC 



60 
100 
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15 



20 
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(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
CCGGAGCACA TTTATCACAC AGCAGCTGAT GACCAGTTTC CGGATCGTAA TGCAGGTATT 60 
TTGGAGGCAG AGTTTCTTTG TCGTCGTCGT CG- 92 

(2) INFORMATION FOR SEQ ID NO: 93: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
ACAAACACAA TCGATTTGAT ACTAGA 2 6 

40 (2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
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TTTGTTTTAA CTAATTAAAG GAGGAATAAA ATATGAGAGG ATCGCATCAC 50 
(2) INFORMATION FOR SEQ ID NO: 95: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95; 
CATCACCATC ACGAAACCTT CCCGCCGAAA TACCTGCACT ACGACGAAGA 50 
(2) INFORMATION FOR SEQ ID NO: 96: 



20 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 49 base pairs 
2 5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



35 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
AACCTCCCAC CAGCTGCTGT^ GCGACAAATG CCCGCCGGGT ACCCAAACA 49 
(2) INFORMATION FOR SEQ ID NO: 97: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single' 

(D) TOPOLOGY: linear 



45 



(ii) MOLECULE TYPE: cDNA 



50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
TGTTTGGGTA CCCGGCGGGC ATTTGT 2 6 
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(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: cDNA 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

CGCACAGCAG CTGGTGGGAG GTTTCTTCGT CGTAGTGCAG GTATTTCGGC 50 
{2) INFORMATION FOR SEQ ID NO: 99: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
GGGAAGGTTT CGTGATGGTG ATGGTGATGC GATCCTCTCA TATTTTATT 49 
(2) INFORMATION FOR SEQ ID NO: 100: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



45 



(ii) MOLECULE TYPE: cDNA 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
CCTCCTTTAA TTAGTTAAAA CAAATCTAGT ATCAAATCGA TTGTGTTTGT 



50 
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(2) INFORMATION FOR SEQ ID NO:101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

15 ACAAACACAA TCGATTTGAT ACTAGATTTG TTTTAACTAA TTAAAGGAGG AATAAAATG 59 

(2) INFORMATION FOR SEQ ID NO: 102: ■ 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: cDNA 



3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

CTAATTAAAG GAGGAATAAA ATGAAAGAAA CTTTTCCTCC AAAATATC 48 
(2) INFORMATION FOR SEQ ID NO: 103: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: -31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
TGTTTGGGTA CCCGGCGGAC ATTTATCACA C 31 
(2) INFORMATION FOR SEQ ID NO: 104: 
(i) SEQUENCE CHARACTERISTICS: 



45 



50 
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(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: cDNA 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
ACAAACACAA TCGATTTGAT ACTAGATTTG TTTTAACTAA TTAAAGGAGG AATAAAATG 59 
15 (2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

3 0 CTAATTAAAG GAGGAATAAA ATGAAAAAAA AAGAAACTTT TCCTCCAAAA TATC 54 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

40 (ii) MOLECULE TYPE: cDNA 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

TGTTTGGGTA CCCGGCGGAC ATTTATCACA C 31 
(2) INFORMATION FOR SEQ ID NO: 107: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
CAGCCCGGGT AAAATGGAAA CGTTTCCTCC AAAATATCTT CATT 44 
(2) INFORMATION FOR SEQ ID NO: 108: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



10 



20 



(ii) MOLECULE TYPE: cDNA 



25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
CGTTTCCATT TTACCCGGGC TGAGCGAGAG GCTCTTCTGC GTGT 44 
3 0 (2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

45 CGCTCAGCCC GGGTAAAATG GAAACGTTGC CTCCAAAATA CCTGC 45 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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10 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
CCATTTTACC CGGGCTGAGC GAGAGGCTCT TCTGCGTGT 39 
(2) INFORMATION FOR SEQ ID NO: 111: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 6 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111- 

25 

GAAAATAAGC TGCTTAGCTG CAGCTGAACC AAAATC 
(2) INFORMATION FOR SEQ ID NO: 112: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: cDNA 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
CAGCTGCAGC TAAGCAGCTT ATTTTCACGG ATTG 
45 (2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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10 



20 



25 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:113: 
AAAAATAAGC TGCTTAGCTG CAGCTGAACC AAAATC 3 6 

(2) INFORMATION FOR SEQ ID NO: 114: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
CAGCTGCAGC TAAGCAGCTT ATTTTTACTG ATTGG 35 
(2) INFORMATION FOR SEQ ID NO: 115: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base pairs 
3 0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

40 

CTAGAAGGAG GAATAACATA TGGAAACTTT TGCTCCAAAA TATCTTCATT ATGATGAAGA 60 
AACTAGTCAT CAGCTGCTGT GTGATAAATG TCCGCCGGGT AC 102 
45 (2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 base pairs 

(B) TYPE: nucleic acid 
50 .(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SSQ ID NO: 116: 
CCGGCGGACA TTTATCACAC AGCAGCTGAT GACTAGTTTC TTCATCATAA TGAAGATATT 

10 

TTGGAGCAAA AGTTTCCATA TGTTATTCCT CCTT 
(2) INFORMATION FOR SEQ ID NO: 117: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: CDNA 



25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
CTAGAAGGAG GAATAACATA TGGAAACTTT TCCTGCTAAA TATCTTCATT ATGATGAAGA 60 
30 AA ^2 

(2) INFORMATION FOR SEQ ID NO: 118: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



40 



(ii) MOLECULE TYPE: cDNA 



45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
CTAGTTTCTT CATCATAATG AAGATATTTA GCAGGAAAAG TTTCCATATG TTATTCCTCC 60 
50 TT 

(2) INFORMATION FOR SEQ ID NO: 119: 



62 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

Tyr His Tyr Tyr Asp Gin Asn Gly Arg Met Cys Glu Glu Cys His Met 
15 ' 

Cys Gin Pro Gly His Phe Leu Val Lys His Cys Lys Gin Pro Lys Arg 
20 25 30 

Asp Thr Val Cys His Lys Pro Cys Glu Pro Gly Val Thr Tyr Thr Asp 
20 35 40 45 



25 



35 



40 



45 



Asp Trp His 
50 



(2) INFORMATION FOR SEQ ID NO: 120: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2432 base pairs 
3 0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 124., 1326 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 
ATCAAAGGCA GGGCATACTT CCTGTTGCCC AGACCTTATA TAAAACGTCA TGTTCGCCTG 60 
GGCAGCAGAG AAGCACCTAG CACTGGCCCA GCGGCTGCCG CCTGAGGTTT CCAGAGGACC 12 0 



ACA ATG AAC AAG TGG CTG TGC TGT GCA CTC CTG GTG TTC TTG GAC ATC 168 
Met Asn Lys Trp Leu Cys Cys Ala Leu Leu Val Phe Leu Asp lie 
50 1 5 10 15 
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10 



30 



50 



ATT GAA TGG ACA ACC CAG GAA ACC TTT CCT CCA AAA TAG TTG CAT TAT 216 
lie Glu Trp Thr Thr Gin Glu Thr Phe Pro Pro Lys Tyr Leu His Tyr 
20 25 30 



GAG CCA GAA ACC GGA CGT CAG CTC TTG TGT GAC AAA TGT GCT CCT GGC 2 64 

Asp Pro Glu Thr Gly Arg Gin Leu Leu Cys Asp Lys Cys Ala Pro Gly 
35 40 45 

ACC TAG CTA AAA CAG CAC TGC ACA GTC AGG AGG AAG ACA CTG TGT GTC 312 
Thr Tyr Leu Lys Gin His Cys Thr Val Arg Arg Lys Thr Leu Cys Val 
50 55 60 



15 CCT TGC CCT GAC TAC TCT TAT ACA GAC AGC TGG CAC ACG AGT GAT GAA 
Pro Cys Pro Asp Tyr Ser Tyr Thr Asp Ser Trp His Thr Ser Asp Glu 
65 70 75 



3 5 AGA TGT CCG GAT GGG TTC TTC TCA GGT GAG ACG TCA TGG AAA GCA CCC 
Arg Cys Pro Asp Gly Phe Phe Ser Gly Glu Thr Ser Ser Lys Ala Pro 
145 150 155 



360 



TGC GTG TAC TGC AGC CCC GTG TGC AAG GAA CTG CAG ACC GTG AAA CAG 408 
2 0 Cys Val Tyr Cys Ser Pro Val Cys Lys Glu Leu Gin Thr Val Lys Gin 
80 85 90 95 

GAG TGC AAC CGC ACC CAC AAC CGA GTG TGC GAA TGT GAG GAA GGG CGC 45 6 

Glu Cys Asn Arg Thr His Asn Arg Val Cys Glu Cys Glu Glu Gly Arg 
25 100 105 110 

TAC CTG GAG CTC GAA TTC TGC TTG AAG CAC CGG AGC TGT CCC CCA GGC 504 
Tyr Leu Glu Leu Glu Phe Cys Leu Lys His Arg Ser Cys Pro Pro Gly 
115 120 125 



TTG GGT GTG CTG CAG GCT GGG ACC CCA GAG CGA AAC ACG GTT TGC AAA 5 52 

Leu Gly Val Leu Gin Ala Gly Thr Pro Glu Arg Asn Thr Val Cys Lys 
130 135 140 



600 



TGT AGG AAA CAC ACC AAC TGC AGC TCA CTT GGC CTC CTG CTA ATT CAG 648 
40 Cys Arg Lys His Thr Asn Cys Ser Ser Leu Gly Leu Leu Leu lie Gin 
160 165 170 175 

AAA GGA AAT GCA ACA CAT GAC AAT GTA TGT TCC GGA AAC AGA GAA GCA 696 
Lys Gly Asn Ala Thr His Asp Asn Val Cys Ser Gly Asn Arg Glu Ala 
45 180 185 190 

ACT CAA AAT TGT GGA ATA GAT GTC ACC CTG TGC GAA GAG GCA TTC TTC 744 
Thr Gin Asn Cys Gly lie Asp Val Thr Leu Cys Glu Glu Ala Phe Phe 
195 200 205 



AGG TTT GCT GTG CCT ACC AAG ATT ATA CCG AAT TGG CTG AGT GTT CTG 792 
Arg Phe Ala Val Pro Thr Lys lie lie Pro Asn Trp Leu Ser Val Leu 
210 215 220 
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GTG GAC ACT TTG CCT GGG ACC AAA GTG AAT GCA GAG AGT GTA GAG AGG 840 
Val Asp Ser Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg 
225 230 235 

5 

ATA AAA CGG AGA CAC AGC TCG CAA GAG CAA ACT TTG CAG CTA CTT AAG 888 
He Lys Arg Arg His Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys 
240 245 250 255 

10 CTG TGG AAG CAT CAA AAC AGA GAC CAG GAA ATG GTG AAG AAG ATC ATC 93 6 

Leu Trp Lys His Gin Asn Arg Asp Gin Glu Met Val Lys Lys He He 
260 265 270 



CAA GAC ATT GAC CTC TGT GAA AGC AGT GTG CAA CGG CAT ATC GGC CAC 
15 Gin Asp He Asp Leu Cys Glu Ser Ser Val Gin Arg His He Gly His 
275 280 285 



984 



GCG AAC CTC ACC ACA GAG CAG CTC CGC ATC TTG ATG GAG AGC TTG CCT 103 2 

Ala Asn Leu Thr Thr Glu Gin Leu Arg He Leu Met Glu Ser Leu Pro 
20 290 295 300 

GGG AAG AAG ATC AGC CCA GAC GAG ATT GAG AGA ACG AGA AAG ACC TGC 1080 
Gly Lys Lys He Ser Pro Asp Glu He Glu Arg Thr Arg Lys Thr Cys 
305 310 315 

25 

AAA CCC AGC GAG CAG CTC CTG AAG CTA CTG AGC TTG TGG AGG ATC AAA 112 8 

Lys Pro Ser Glu Gin Leu Leu Lys Leu Leu Ser Leu Trp Arg He Lys 
320 325 330 335 

3 0 AAT GGA GAC CAA GAC ACC TTG AAG GGC CTG ATG TAG GCA CTC AAG CAC 1176 
Asn Gly Asp Gin Asp Thr Leu Lys Gly Leu Met Tyr Ala Leu Lys His 
340 345 350 

TTG AAA GCA TAG CAC TTT CCC AAA ACC GTC ACC CAC AGT CTG AGG AAG 12 24 

3 5 Leu Lys Ala Tyr His Phe Pro Lys Thr Val Thr His Ser Leu Arg Lys 
355 360 365 

ACC ATC AGG TTC TTG CAC AGC TTC ACC ATG TAC CGA TTG TAT CAG AAA 1272 
Thr He Arg Phe Leu His Ser Phe Thr Met Tyr Arg Leu Tyr Gin Lys 
40 370 375 380 

CTC TTT CTA GAA ATG ATA GGG AAT CAG GTT CAA TCA GTG AAG ATA AGC 1320 
Leu Phe Leu Glu Met He Gly Asn Gin Val Gin Ser Val Lys He Ser 
385 390 395 

45 

TGC TTA TAGTTAGGAA TGGTCACTGG GCTGTTTCTT CAGGATGGGC CAACACTGAT 1376 

Cys Leu 

400 

50 GGAGCAGATG GCTGCTTCTC CGGCTCTTGA AATGGCAGTT GATTCCTTTC TCATCAGTTG 143 6 
GTGGGAATGA AGATCCTCCA GCCCAACACA CACACTGGGG AGTCTGAGTC AGGAGAGTGA 1496 



10 
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GGCAGGCTAT TTGATAATTG TGCAAAGCTG CCAGGTGTAC ACCTAGAAAG TCAAGCACCC 1556 

TGAGAAAGAG GATATTTTTA TAACCTCAAA CATAGGCCCT TTCCTTCCTC TCCTTATGGA 1616 

TGAGTACTCA GAAGGCTTCT ACTATCTTCT GTGTCATCCC TAGATGAAGG CCTCTTTTAT 167 6 

TTATTTTTTT ATTCTTTTTT TCGGAGCTGG GGACCGAACC CAGGGCCTTG CGCTTGCGAG 173 6 

GCAAGTGCTC TACCACTGAG CTAAATCTCC AACCCCTGAA GGCCTCTTTC TTTCTGCCTC 1796 

TGATAGTCTA TGACATTCTT TTTTCTACAA TTCGTATCAG GTGCACGAGC CTTATCCCAT 1856 

TTGTAGGTTT CTAGGCAAGT TGACCGTTAG CTATTTTTCC CTCTGAAGAT TTGATTCGAG 1916 

15 TTGCAGACTT GGCTAGACAA GCAGGGGTAG GTTATGGTAG TTTATTTAAC AGACTGCCAC 1976 

CAGGAGTCCA GTGTTTCTTG TTCCTCTGTA GTTGTACCTA AGCTGACTCC AAGTACATTT 2 036 

AGTATGAAAA ATAATCAACA AATTTTATTC CTTCTATCAA CATTGGCTAG CTTTGTTTCA 2 09 6 

20 

GGGCACTAAA AGAAACTACT ATATGGAGAA AGAATTGATA TTGCCCCCAA CGTTCAACAA 2156 

CCCAATAGTT TATCCAGCTG TCATGCCTGG TTCAGTGTCT ACTGACTATG CGCCCTCTTA '2216 

25 TTACTGCATG CAGTAATTCA ACTGGAAATA GTAATAATAA TAATAGAAAT AAAATCTAGA 2276 

CTCCATTGGA TCTCTCTGAA TATGGGAATA TCTAACTTAA GAAGCTTTGA GATTTCAGTT 233 6 

GTGTTAAAGG CTTTTATTAA AAAGCTGATG CTCTTCTGTA AAAGTTACTA ATATATCTGT 23 9 6 

AAGACTATTA CAGTATTGCT ATTTATATCC ATCCAG 2432 



^30 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

Met Asn Lys Trp Leu Cys Cys Ala Leu Leu Val Phe Leu Asp lie lie 
50 1 , 5 10 15 

Glu Trp Thr Thr Gin- Glu Thr Phe Pro Pro Lys Tyr Leu His Tyr Asp 
20 25 30 
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Pro Glu Thr Gly Arg Gin Leu Leu Cys Asp Lys Cys Ala Pro Gly Thr 

35 40 45 

5 Tyr Leu Lys Gin His Cys Thr Val Arg Arg Lys Thr Leu Cys Val Pro 

50 55 60 



10 



25 



40 



Cys Pro Asp Tyr Ser Tyr Thr Asp Ser Trp His Thr Ser Asp Glu Cys 
65 70 75 80 

Val Tyr Cys Ser Pro Val Cys Lys Glu Leu Gin Thr Val Lys Gin Glu 
85 90 95 



Cys Asn Arg Thr His Asn Arg Val Cys Glu Cys Glu Glu Gly Arg Tyr 
15 100 105 110 

Leu Glu Leu Glu Phe Cys Leu Lys His Arg Ser Cys Pro Pro Gly Leu 
115 120 125 

20 Gly Val Leu Gin Ala Gly Thr Pro Glu Arg Asn Thr Val Cys Lys Arg 
130 135 140 



Cys Pro Asp Gly Phe Phe Ser Gly Glu Thr Ser Ser Lys Ala Pro Cys 

145 150 155 160 

Arg Lys His Thr Asn Cys Ser Ser Leu Gly Leu Leu Leu lie Gin Lys 

165 170 175 



Gly Asn Ala Thr His Asp Asn Val Cys Ser Gly Asn Arg Glu Ala Thr 
30 180 185 190 

Gin Asn Cys Gly He Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg 
195 200 205 

35 Phe Ala Val Pro Thr Lys He He Pro Asn Trp Leu Ser Val Leu Val 
210 215 220 



Asp Ser Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg He 
225 230 235 240 

Lys Arg Arg His Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu 
245 250 255 



Trp Lys His Gin Asn Arg Asp Gin Glu Met Val Lys Lys He He Gin 

45 260 265 270 

Asp He Asp Leu Cys Glu Ser Ser Val Gin Arg His He Gly His Ala 
275 280 285 

50 Asn Leu Thr Thr Glu Gin Leu Arg He Leu Met Glu Ser Leu Pro Gly 
290 295 300 
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10 



15 



20 



ys Lys He Ser Pro Asp Glu He Glu Arg Thr Arg Lys Thr Cys Lys 
305 310 315 320 

Pro Ser Glu Gin Leu Leu Lys Leu Leu Ser Leu Trp Arg He Lys Asn 
325 330 335 

Gly Asp Gin Asp Thr Leu Lys Gly Leu Met Tyr Ala Leu Lys His Leu 
340 345 350 

Lys Ala Tyr His Phe Pro Lys Thr Val Thr His Ser Leu Arg Lys Thr 
355 360 365 

He Arg Phe Leu His Ser Phe Thr Met Tyr Arg Leu Tyr Gin Lys Leu 
370 375 380 

Phe Leu Glu Met He Gly Asn Gin Val Gin Ser Val Lys He Ser Cys 
385 390 395 400 

Leu 



(2) INFORMATION FOR SEQ ID NO: 122: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1324 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

3 0 (ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
35 (B) LOCATION: 90.. 1292 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
40 CCTTATATAA ACGTCATGAT TGCCTGGGCT GCAGAGACGC ACCTAGCACT GACCCAGCGG 



45 



50 



60 



CTGCCTCCTG AGGTTTCCCG AGGACCACA ATG AAC AAG TGG CTG TGC TGC GCA 113 

Met Asn Lys Trp Leu Cys Cys Ala 
1 5 



CTC CTG GTG CTC CTG GAC ATC ATT GAA TGG ACA ACC CAG GAA ACC CTT 161 
Leu Leu Val Leu Leu Asp He He Glu Trp Thr Thr Gin Glu Thr Leu 
10 15 20 

CCT CCA AAG TAC TTG CAT TAT GAC CCA GAA ACT GGT CAT CAG CTC CTG 209 
Pro Pro Lys Tyr Leu His Tyr Asp Pro Glu Thr Gly His Gin Leu Leu 
25 30 ^ 35 40 
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TGT GAC AAA TGT OCT CCT GGC ACC TAG CTA AAA CAG CAC TGC ACA GTG 2 57 

Cys Asp Lys Cys Ala Pro Gly Thr Tyr Leu Lys Gin His Cys Thr Val 

45 50 55 

5 

AGG AGG AAG ACA TTG TGT GTC CCT TGC CCT GAC CAC TCT TAT ACG GAC 3 05 

Arg Arg Lys Thr Leu Cys Val Pro Cys Pro Asp His Ser Tyr Thr Asp 

60 65 70 

10 AGC TGG CAC ACC AGT GAT GAG TGT GTG TAT TGC AGC CCA GTG TGC AAG 353 
Ser Trp His Thr Ser Asp Glu Cys Val Tyr Cys Ser Pro Val Cys Lys 
75 80 85 

GAA CTG CAG TCC GTG AAG CAG GAG TGC AAC CGC ACC CAC AAC CGA GTG 401 
15 Glu Leu Gin Ser Val Lys Gin Glu Cys Asn Arg Thr His Asn Arg Val 
90 95 100 

TGT GAG TGT GAG GAA GGG CGT TAC CTG GAG ATC GAA TTC TGC TTG AAG 449 
Cys Glu Cys Glu Glu Gly Arg Tyr Leu Glu lie Glu Phe Cys Leu Lys 
20 105 110 115 120 

CAC CGG AGC TGT CCC CCG GGC TCC GGC GTG GTG CAA GCT GGA ACC CCA 497 
His Arg Ser Cys Pro Pro Gly Ser Gly Val Val Gin Ala Gly Thr Pro 
125 130 135 

25 

GAG CGA AAC ACA GTT TGC AAA AAA TGT CCA GAT GGG TTC TTC TCA GGT 545 
Glu Arg Asn Thr Val Cys Lys Lys Cys Pro Asp Gly Phe Phe Ser Gly 
140 145 150 

3 0 GAG ACT TCA TCG AAA GCA CCC TGT ATA AAA CAC ACG AAC TGC AGC ACA 593 
Glu Thr Ser Ser Lys Ala Pro Cys He Lys His Thr Asn Cys Ser Thr 
155 160 165 

TTT GGC CTC CTG CTA ATT CAG AAA GGA AAT GCA ACA CAT GAC AAC GTG 641 
3 5 Phe Gly Leu Leu Leu He Gin Lys Gly Asn Ala Thr His Asp Asn Val 
170 175 180 

TGT TCC GGA AAC AGA GAA GCC ACG CAA AAG TGT GGA ATA GAT GTC ACC 689 
Cys Ser Gly Asn Arg Glu Ala Thr Gin Lys Cys Gly He Asp Val Thr 
40 185 190 195 200 

CTG TGT GAA GAG GCC TTC TTC AGG TTT GCT GTT CCT ACC AAG ATT ATA 737 
Leu Cys Glu Glu Ala Phe Phe Arg Phe Ala Val Pro Thr Lys He He 
205 210 215 

45 

CCA AAT TGG CTG AGT GTT TTG GTG GAC AGT TTG CCT GGG ACC AAA GTG 785 
Pro Asn Trp Leu Ser Val Leu Val Asp Ser Leu Pro Gly Thr Lys Val 
220 225 230 

50 AAT GCC GAG AGT GTA GAG AGG ATA AAA CGG AGA CAC AGC TCA CAA GAG 833 
Asn Ala Glu Ser Val Glu Arg He Lys Arg Arg His Ser Ser Gin Glu 
235 240 245 
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10 



30 



40 



45 



CAA ACC TTC CAG CTG CTG AAG CTG TGG AAA CAT CAA AAC AGA GAC CAG 
Gin Thr Phe Gin Leu Leu Lys Leu Trp Lys His Gin Asn Arg Asp Gin 
250 255 260 



881 



GAA ATG GTG AAG AAG ATC ATC CAA GAC ATT GAC CTC TGT GAA AGC AGC 929 
Glu Met Val Lys Lys He He Gin Asp He Asp Leu Cys Glu Ser Ser 
265 270 275 280 

GTG CAG CGG CAT CTC GGC CAC TCG AAC CTC ACC ACA GAG CAG CTT CTT 977 
Val Gin Arg His Leu Gly His Ser Asn Leu Thr Thr Glu Gin Leu Leu 
285 290 295 

GCC TTG ATG GAG AGC CTG CCT GGG AAG AAG ATC AGC CCA GAA GAG ATT 1025 
Ala Leu Met Glu Ser Leu Pro Gly Lys Lys He Ser Pro Glu Glu He 
15 300 305 310 

GAG AGA ACG AGA AAG ACC TGC AAA TCG AGC GAG CAG CTC CTG AAG CTA 1073 
Glu Arg Thr Arg Lys Thr Cys Lys Ser Ser Glu Gin Leu Leu Lys Leu 
315 320 325 

20 

CTC AGT TTA TGG AGG ATC AAA AAT GGT GAC CAA GAC ACC TTG AAG GGC 1121 
Leu Ser Leu Trp Arg He Lys Asn Gly Asp Gin Asp Thr Leu Lys Gly 
330 335 340 

25 CTG ATG TAT GCC CTC AAG CAC TTG AAA ACA TCC CAC TTT CCC AAA ACT 1169 
Leu Met Tyr Ala Leu Lys His Leu Lys Thr Ser His Phe Pro Lys Thr 
345 350 355 360 



GTC ACC CAC AGT CTG AGG AAG ACC ATG AGG TTC CTG CAC AGC TTC ACA 1217 
Val Thr His Ser Leu Arg Lys Thr Met Arg Phe Leu His Ser Phe Thr 
365 370 375 

ATG TAC AGA CTG TAT CAG AAG CTC TTT TTA GAA ATG ATA GGG AAT CAG 1265 
Met Tyr Arg Leu Tyr Gin Lys Leu Phe Leu Glu Met He Gly Asn Gin 
35 380 385 390 

GTT CAA TCC GTG AAA ATA AGC TGC TTA TAACTAGGAA TGGTCACTGG 1312 
Val Gin Ser Val Lys He Ser Cys Leu 
395 400 



GCTGTTTCTT CA ^324 



(2) INFORMATION FOR SEQ ID NO: 123: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 
50 (B) TYPE: amino acid 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SSQ ID NO:123: 

Met Asn Lys Trp Leu Cys Cys Ala Leu Leu Val Leu Leu Asp lie lie 
5 1 5 10 15 

Glu Trp Thr Thr Gin Glu Thr Leu Pro Pro Lys Tyr Leu His Tyr Asp 
20 25 30 

10 Pro Glu Thr Gly His Gin Leu Leu Cys Asp Lys Cys Ala Pro Gly Thr 
35 40 45 



15 



30 



45 



Tyr Leu Lys Gin His Cys Thr Val Arg Arg Lys Thr Leu Cys Val Pro 
50 55 60 

Cys Pro Asp His Ser Tyr Thr Asp Ser Trp His Thr Ser Asp Glu Cys 
65 70 75 80 



Val Tyr Cys Ser Pro Val Cys Lys Glu Leu Gin Ser Val Lys Gin Glu 
20 85 90 95 

Cys Asn Arg Thr His Asn Arg Val Cys Glu Cys Glu Glu Gly Arg Tyr 
100 105 110 

2 5 Leu Glu lie Glu Phe Cys Leu Lys His Arg Ser Cys Pro Pro Gly Ser 
115 120 125 



Gly Val Val Gin Ala Gly Thr Pro Glu Arg Asn Thr Val Cys Lys Lys 
130 135 140 

Cys Pro Asp Gly Phe Phe Ser Gly Glu Thr Ser Ser Lys Ala Pro Cys 
145 150 155 160 



He Lys His Thr Asn Cys Ser Thr Phe Gly Leu Leu Leu He Gin Lys 
35 165 170 175 

Gly Asn Ala Thr His Asp Asn Val Cys Ser Gly Asn Arg Glu Ala Thr 
180 185 190 

40 Gin Lys Cys Gly He Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg 
195 200 205 



Phe Ala Val Pro Thr Lys He He Pro Asn Trp Leu Ser Val Leu Val 
210 215 220 

Asp Ser Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg He 
225 230 235 240 



Lys Arg Arg His Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu 
50 245 250 255 



Trp Lys His Gin Asn Arg Asp Gin Glu Met Val Lys Lys He He Gin 
260 265 270 
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Asp lie Asp 
275 

5 Asn Leu Thr 
290 

Lys Lys lie 
305 

10 

Ser Ser Glu 



Gly Asp Gin 

15 

Lys Thr Ser 
355 

20 Met Arg Phe 
370 

Phe Leu Glu 
385 

25 

Leu 



Leu Cys Glu Ser 



Thr Glu Gin Leu 
295 

Ser Pro Glu Glu 
310 

Gin Leu Leu Lys 
325 

Asp Thr Leu Lys 
340 

His Phe Pro Lys 



Leu His Ser Phe 
375 

Met lie Gly Asn 
390 



Ser Val Gin Arg 
280 

Leu Ala Leu Met 



lie Glu Arg Thr 
315 

Leu Leu Ser Leu 
330 

Gly Leu Met Tyr 
345 

Thr Val Thr His 
360 

Thr Met Tyr Arg 



Gin Val Gin Ser 
395 



His Leu Gly His Ser 
285 

Glu Ser Leu Pro Gly 
300 

Arg Lys Thr Cys Lys 
320 

Trp Arg lie Lys Asn 
335 

Ala Leu Lys His Leu 
350 

Ser Leu Arg Lys Thr 
365 

Leu Tyr Gin Lys Leu 
380 

Val Lys lie Ser Cys 
400 



(2) INFORMATION FOR SEQ ID NO: 124: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1355 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
3 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



40 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 94.. 1296' 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

GTATATATAA CGTGATGAGC GTACGGGTGC GGAGACGCAC CGGAGCGCTC GCCCAGCCGC 60 

50 CGCTCCAAGC CCCTGAGGTT TCCGGGGACC ACA ATG AAC AAG TTG CTG TGC TGC 114 

Met Asn Lys Leu Leu Cys Cys 
1 5 
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10 



20 



30 



GCG CTC GTG TTT CTG GAG ATC TCC ATT AAG TGG ACC ACC GAG GAA ACG 162 
Ala Leu Val Phe Leu Asp lie Ser He Lys Trp Thr Thr Gin Glu Thr 
10 15 20 

TTT OCT CCA AAG TAG CTT CAT TAT GAG GAA GAA ACC TCT CAT GAG CTG 210 
Phe Pro Pro Lys Tyr Leu His Tyr Asp Glu Glu Thr Ser His Gin Leu 
25 30 35 

TTG TGT GAC AAA TGT CCT CCT GGT ACC TAC CTA AAA CAA CAC TGT ACA 258 
Leu Cys Asp Lys Cys Pro Pro Gly Thr Tyr Leu Lys Gin His Cys Thr 
40 45 50 55 



GCA AAG TGG AAG ACC GTG TGC GCC CCT TGC CCT GAC CAC TAC TAC ACA 
Ala Lys Trp Lys Thr Val Cys Ala Pro Cys Pro Asp His Tyr Tyr Thr 
15 60 65 70 



AAG GAG CTG CAG TAC GTC AAG CAG GAG TGC AAT CGC ACC CAC AAC CGC 
Lys Glu Leu Gin Tyr Val Lys Gin Glu Cys Asn Arg Thr His Asn Arg 
90 95 100 

25 GTG TGC GAA TGC AAG GAA GGG CGC TAC CTT GAG ATA GAG TTC TGC TTG 
Val Cys Glu Cys Lys Glu Gly Arg Tyr Leu Glu He Glu Phe Cys Leu 
105 110 115 



AAA CAT AGG AGC TGC CCT CCT GGA TTT GGA GTG GTG CAA GCT GGA ACC 
Lys His Arg Ser Cys Pro Pro Gly Phe Gly Val Val Gin Ala Gly Thr 
120 125 130 • 135 



306 



GAC AGC TGG CAC ACC AGT GAC GAG TGT CTA TAC TGC AGC CCC GTG TGC 354 
Asp Ser Trp His Thr Ser Asp Glu Cys Leu Tyr Cys Ser Pro Val Cys 
75 80 85 



402 



450 



498 



CCA GAG CGA AAT ACA GTT TGC AAA AGA TGT CCA GAT GGG TTC TTC TCA 546 
Pro Glu Arg Asn Thr Val Cys Lys Arg Cys Pro Asp Gly Phe Phe Ser 
35 140 145 150 

AAT GAG ACG TCA TCT AAA GCA CCC TGT AGA AAA CAC ACA AAT TGC AGT 594 
Asn Glu Thr Ser Ser Lys Ala Pro Cys Arg Lys His Thr Asn Cys Ser 
40 155 160 165 

GTC TTT GGT CTC CTG CTA ACT CAG AAA GGA AAT GCA ACA CAC GAC AAC 642 
Val Phe Gly Leu Leu Leu Thr Gin Lys Gly Asn Ala Thr His Asp Asn 
170 175 180 

45 

ATA TGT TCC GGA AAC AGT GAA TCA ACT CAA AAA TGT GGA ATA GAT GTT 690 
He Cys Ser Gly Asn Ser Glu Ser Thr Gin Lys Cys Gly He Asp Val 
185 190 195 

50 ACC CTG TGT GAG GAG GCA TTC TTC AGG TTT GCT GTT CCT ACA AAG TTT 738 
Thr Leu Cys Glu Glu Ala Phe Phe Arg Phe Ala Val Pro Thr Lys Phe 
200 205 210 215 
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20 



30 



ACG CCT AAC TGG CTT ACT GTC TTG GTA GAG AAT TTG CCT GGC ACC AAA 
Thr Pro Asn Trp Leu Ser Val Leu Val Asp Asn Leu Pro Gly Thr Lys 
220 225 230 



25 ATT GAA AAA ACA ATA AAG GCA TGC AAA CCC AGT GAG CAG ATC CTG AAG 
He Glu Lys Thr He Lys Ala Cys Lys Pro Ser Asp Gin He Leu Lys 
315 320 325 



786 



GTA AAC GCA GAG AGT GTA GAG AGG ATA AAA CGG CAA CAC AGC TCA CAA 834 
Val Asn Ala Glu Ser Val Glu Arg He Lys Arg Gin His Ser Ser Gin 
235 240 245 

GAA CAG ACT TTC CAG CTG CTG AAG TTA TGG AAA CAT CAA AAC AAA GCC 882 
Glu Gin Thr Phe Gin Leu Leu Lys Leu Trp Lys His Gin Asn Lys Ala 
250 255 260 



CAA GAT ATA GTC AAG AAG ATC ATC CAA GAT ATT GAC CTC TGT GAA AAC 93 0 
Gin Asp He Val Lys Lys He He Gin Asp He Asp Leu Cys Glu Asn 
15 265 270 275 

AGC GTG CAG CGG CAC ATT GGA CAT GCT AAC CTC ACC TTC GAG CAG CTT 97 8 
Ser Val Gin Arg His He Gly His Ala Asn Leu Thr Phe Glu Gin Leu 
280 285 290 295 



CGT AGC TTG ATG GAA AGC TTA CCG GGA AAG AAA GTG GGA GCA GAA GAC 102 6 

Arg Ser Leu Met Glu Ser Leu Pro Gly Lys Lys Val Gly Ala Glu Asp 
300 305 310 



1074 



CTG CTC AGT TTG TGG CGA ATA AAA AAT GGC GAC CAA GAC ACC TTG AAG 1122 
Leu Leu Ser Leu Trp Arg He Lys Asn Gly Asp Gin Asp Thr Leu Lys 
330 335 340 



GGC CTA ATG CAC GCA CTA AAG CAC TCA AAG ACG TAG CAC TTT CCC AAA 1170 
Gly Leu Met His Ala Leu Lys His Ser Lys Thr Tyr His Phe Pro Lys 
35 345 350 355 

ACT GTC ACT CAG AGT CTA AAG AAG ACC ATC AGG TTC CTT CAC AGC TTC 1218 
Thr Val Thr Gin Ser Leu Lys Lys Thr He Arg Phe Leu His Ser Phe 
360 365 370 375 

40 

ACA ATG TAC AAA TTG TAT CAG AAG TTA TTT TTA GAA ATG ATA GGT AAC 12 66 

Thr Met Tyr Lys Leu Tyr Gin Lys Leu Phe Leu Glu Met He Gly Asn 
380 385 390 

45 CAG GTC CAA TCA GTA AAA ATA AGC TGC TTA TAACTGGAAA TGGCCATTGA 1316 
Gin Val Gin Ser Val Lys He Ser Cys Leu 
395 400 



50 



GCTGTTTCCT CACAATTGGC GAGATCCCAT GGATGATAA 13 55 



(2) INFORMATION FOR SEQ ID NO: 125: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:125: 

Met Asn Lys Leu Leu Cys Cys Ala Leu Val Phe Leu Asp He Ser He 
15 10 15 

Lys Trp Thr Thr Gin Glu Thr Phe Pro Pro Lys Tyr Leu His Tyr Asp 
20 25 30 

Glu Glu Thr Ser His Gin Leu Leu Cys Asp Lys Cys Pro Pro Gly Thr 
35 40 45 

Tyr Leu Lys Gin His Cys Thr Ala Lys Trp Lys Thr Val Cys Ala Pro 
50 55 60 

Cys Pro Asp His Tyr Tyr Thr Asp Ser Trp His Thr Ser Asp Glu Cys 
65 70 75 80 

Leu Tyr Cys Ser Pro Val Cys Lys Glu Leu Gin Tyr Val Lys Gin Glu 
85 90 95 

Cys Asn Arg Thr His Asn Arg Val Cys Glu Cys Lys Glu Gly Arg Tyr 
100 105 110 

Leu Glu He Glu Phe Cys Leu Lys His Arg Ser Cys Pro Pro Gly Phe 
115 120 125 

Gly Val val Gin Ala Gly Thr Pro Glu Arg Asn Thr Val Cys Lys Arg 
130 135 140 

Cys Pro Asp Gly Phe Phe Ser Asn Glu Thr Ser Ser Lys Ala Pro Cys 
145 150 155 160 

Arg Lys His Thr Asn Cys Ser Val Phe Gly Leu Leu Leu Thr Gin Lys 
165 170 175 

Gly Asn Ala Thr His Asp Asn He Cys Ser Gly Asn Ser Glu Ser Thr 
180 185 190 

Gin Lys Cys Gly He Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg 
195 200 205 

Phe Ala Val Pro Thr Lys Phe Thr Pro Asn Trp Leu Ser Val Leu Val 
210 215 220 



Asp Asn Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg He 
225 230 . 235 240 
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Lys Arg Gin His Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu 

245 250 255 

5 Trp Lys His Gin Asn Lys Ala Gin Asp He Val Lys Lys He He Gin 
260 265 270 



10 



25 



Asp He Asp Leu Cys Glu Asn Ser Val Gin Arg His He Gly His Ala 
275 280 285 

Asn Leu Thr Phe Glu Gin Leu Arg Ser Leu Met Glu Ser Leu Pro Gly 
290 295 300 



Lys Lys Val Gly Ala Glu Asp He Glu Lys Thr He Lys Ala Cys Lys 
15 305 310 315 320 

Pro Ser Asp Gin He Leu Lys Leu Leu Ser Leu Trp Arg He Lys Asn 
325 330 335 

2 0 Gly Asp Gin Asp Thr Leu Lys Gly Leu Met His Ala Leu Lys His Ser 
340 345 350 



Lys Thr Tyr His Phe Pro Lys Thr Val Thr Gin Ser Leu Lys Lys Thr 
355 360 365 

He Arg Phe Leu His Ser Phe Thr Met Tyr Lys Leu Tyr Gin Lys Leu 
370 375 380 



Phe Leu Glu Met He Gly Asn Gin Val Gin Ser Val Lys He Ser Cys 
30 385 390 395 400 



Leu 



35 (2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

40 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 



50 



Cys 
1 



Pro Gin Gly Lys Tyr He His Pro Gin Asn Asn Ser He Cys Cys 
5 10 15 
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Thr Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro 
20 25 30 

Gly Gin Asp Thr Asp Cys Arg Glu Cys Glu Ser Gly Ser Phe Thr Ala 
5 35 40 45 

Ser Glu Asn His Leu Arg His Cys Leu Ser Cys Ser Lys Cys Arg Lys 
50 55 60 

10 Glu Met Gly Gin Val Glu lie Ser Ser Cys Thr Val Asp Arg Asp Thr 

65 70 75 80 



15 



35 



40 



45 



Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His Tyr Trp Ser Glu Asn 
85 90 95 

Leu Phe Gin Cys Phe Asn Cys Ser Leu Cys Leu Asn Gly Thr Val His 
100 105 110 



Leu Ser Cys Gin Glu Lys Gin Asn Thr Val Cys Thr Cys His Ala Gly 
20 115 120 125 

Phe Phe Leu Arg Glu Asn Glu Cys Val Ser Cys 
130 135 

25 



(2) INFORMATION FOR SEQ ID NO: 127: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
CCGGCGGACA TTTATCACAC AGCAGCTGAT GAGAAGTTTC TTCATCCA 48 



{2) INFORMATION FOR SEQ ID NO: 128: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 8: 

Met Leu Gly He Trp Thr Leu Leu Pro Leu Val Leu Thr Ser Val Ala 
15 10 15 

Arg Leu Ser Ser Lys Ser Val Asn Ala Gin Val Thr Asp He Asn Ser 
20 25 30 

Lys Gly Leu Glu Leu Arg Lys Thr Val Thr Thr Val Glu Thr Gin Asn 
35 40 45 

Leu Glu Gly Leu His His Asp Gly Gin Phe Cys His Lys Pro Cys Pro 
50 55 60 

Pro Gly Glu Arg Lys Ala Arg Asp Cys Thr Val Asn Gly Asp Glu Pro 

70 75 80 

Asp Cys Val Pro Cys Gin Glu Gly Lys Glu Tyr Thr Asp Lys Ala His 
85 90 95 

Phe Ser Ser Lys Cys Arg Arg Cys Arg Leu Cys Asp Glu Gly His Gly 
100 105 110 

Leu Glu Val Glu He Asn Cys Thr Arg Thr Gin Asn Thr Lys Cys Arg 
115 120 125 

Cys Lys Pro Asn Phe Phe Cys Asn Ser Thr Val Cys Glu His Cys Asp 
130 135 140 

Pro Cys Thr Lys Cys Glu His Gly He He Lys Glu Cys Thr Leu Thr 
145 150 155 160 

Ser Asn Thr Lys Cys Lys Glu Glu Gly Ser Arg Ser Asn Leu Gly Trp 
165 170 175 

Leu Cys Leu Leu Leu Leu Pro He Pro Leu He Val Trp Val Lys Arg 
180 185 190 

Lys Glu Val Gin Lys Thr Cys Arg Lys His Arg Lys Glu Asn Gin Gly 
195 200 205 

Ser His Glu Ser Pro Thr Leu Asn Pro Glu Thr 
210 215 



INFORMATION FOR SEQ ID NO: 129: 
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10 



15 



T3 0 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 



Met Gly Leu Ser Thr Val Pro Asp Leu Leu Leu Pro Leu Val Leu Leu 
15 10 15 

Glu Leu Leu Val Gly lie Tyr Pro Ser Gly Val lie Gly Leu Val Pro 
^ 20 25 30 

: His Leu Gly Asp Arg Glu Lys Arg Asp Ser Val Cys Pro Gin Gly Lys 

120 35 40 45 

I Tyr lie His Pro Gin Asn Asn Ser lie Cys Cys Thr Lys Cys His Lys 
I 50 55 60 

" 2 5 Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Gin Asp Thr Asp 

I 65 70 75 80 

t Cys Arg Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu Asn His Leu 

1 85 90 95 



Arg His Cys Leu Ser Cys Ser Lys Cys Arg Lys Glu Met Gly Gin Val 
100 105 110 



Glu lie Ser Ser Cys Thr Val Asp Arg Asp Thr Val Cys Gly Cys Arg 
35 115 120 125 

Lys Asn Gin Tyr Arg His Tyr Trp Ser Glu Asn Leu Phe Gin Cys Phe 
130 135 140 

40 Asn Cys Ser Leu Cys Leu Asn Gly Thr Val His Leu Ser Cys Gin Glu 

145 150 155 160 

Lys Gin Asn Thr Val Cys Thr Cys His Ala Gly Phe Phe Leu Arg Glu 
165 170 175 

45 

Asn Glu Cys Val Ser Cys Ser Asn Cys Lys Lys Ser Leu Glu Cys Thr 
180 185 190 

50 Lys Leu Cys Leu Pro Gin lie Glu Asn Val Lys Gly Thr Glu Asp Ser 

195 200 205 
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10 



15 



Gly Thr Thr Val Leu Leu Pro Leu Val He Phe Phe Gly Leu Cys Leu 
210 215 220 

Leu Ser Leu Leu Phe He Gly Leu Met Tyr Arg Tyr Gin Arg Trp Lys 
225 230 235 240 

Ser Lys Leu Tyr Ser He Val Cys Gly Lys Ser Thr Pro Glu Lys Glu 
245 250 255 

Gly Glu Leu Glu Gly Thr Thr Thr Lys Pro Leu Ala Pro Asn Pro Ser 
260 265 270 

Phe Ser Pro Thr Pro Gly Phe Thr 
275 280 



(2) INFORMATION FOR SEQ ID NO: 130 



(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: protein 



30 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

Met Leu Arg Leu He Ala Leu Leu Val Cys Val Val Tyr Val Tyr Gly 
15 10 15 



Asp Asp Val Pro Tyr Ser Ser Asn Gin Gly Lys Cys Gly Gly His Asp 
^5 20 25 30 

Tyr Glu Lys Asp Gly Leu Cys Cys Ala Ser Cys His Pro Gly Phe Tyr 
35 40 45 



Ala Ser Arg Leu Cys Gly Pro Gly Ser Asn Thr Val Cys Ser Pro Cys 
50 55 60 

Glu Asp Gly Thr Phe Thr Ala Ser Thr Asn His Ala Pro Ala Cys Val 
65 70 75 80 

Ser Cys Arg Gly Pro Cys Thr Gly His Leu Ser Glu Ser Gin Pro Cys 
85 90 95 
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10 



15 



Asp Arg Thr His Asp Arg Val Cys Asn Cys Ser Thr Gly Asn Tyr Cys 

100 105 110 

Leu Leu Lys Gly Gin Asn Gly Cys Arg He Cys Ala Pro Gin Thr Lys 

^15 120 125 

Cys Pro Ala Gly Tyr Gly Val Ser Gly His Thr Arg Ala Gly Asp Thr 

130 135 140 

Leu Cys Glu Lys Cys Pro Pro His Thr Tyr Ser Asp Ser Leu Ser Pro 

145 150 155 160 

Thr Glu Arg Cys Gly Thr Ser Phe Asn Tyr He Ser Val Gly Phe Asn 

165 170 175 

Leu Tyr Pro Val Asn Glu Thr Ser Cys Thr Thr Thr Ala Gly His Asn 

180 185 190 



Glu Val He Lys Thr Lys Glu Phe Thr Val Thr Leu Asn Tyr Thr 
20 195 200 205 

(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 
{C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

3 0 (ii) MOLECULE TYPE: protein 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

Met Ala Pro Val Ala Val Trp Ala Ala Leu Ala Val Gly Leu Glu Leu 

1.5 10 15 

40 Ala Ala Ala His Ala Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr 

20 25 30 

Ala Pro Glu Pro Gly Ser Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin 
35 40 45 

Thr Ala Gin Met Cys Cys Ser Lys Cys Ser Pro Gly Gin His Ala Lys 
50 55 60 

Val Phe Cys Thr Lys Thr Ser Asp Thr Val Cys Asp Ser Cys Glu Asp 
50 65 70 75 80 

Ser Thr Tyr Thr Gin Leu Trp Asn Trp Val Pro Glu Cys Leu Ser Cys 
85 90 95 



45 
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Gly Ser Arg Cys Ser Ser Asp Gin Val Glu Thr Gin Ala Cys Thr Arg 
100 105 110 

Glu Gin Asn Arg He Cys Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu 
115 120 125 

Ser Lys Gin Glu Gly Cys Arg Leu Cys Ala Pro Leu Arg Lys Cys Arg 
130 135 140 

Pro Gly Phe Gly Val Ala Arg Pro Gly Thr Glu Thr Ser Asp Val Val 
. 150 155 160 

Cys Lys Pro Cys Ala Pro Gly Thr Phe Ser Asn Thr Thr Ser Ser Thr 
165 170 175 

Asp He Cys Arg Pro His Gin lie Cys Asn Val Val Ala He Pro Gly 
180 185 190 

Asn Ala Ser Arg Asp Ala Val Cys Thr Ser Thr Ser Pro Thr Arg Ser 
1^5 200 205 

Met Ala Pro Gly Ala Val His Leu Pro Gin Pro Val Ser Thr Arg Ser 
210 215 220 

Gin His Thr 
225 

INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

Met Val Ser Leu Pro Arg Leu Cys Ala Leu Trp Gly Cys Leu Leu Thr 
15 10 15 

Ala Val His Leu Gly Gin Cys Val Thr Cys Ser Asp Lys Gin Tyr Leu 
20 25 30 

His Asp Gly Gin Cys Cys Asp Leu Cys Gin Pro Gly Ser Arg Leu Thr 
35 40 45 
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Ser His Cys Thr Ala Leu Glu Lys Thr Gin Cys His Pro Cys Asp Ser 
50 55 60 

Gly Glu Phe Ser Ala Gin Trp Asn Arg Glu He Arg Cys His Gin His 
65 70 75 80 

Arg His Cys Glu Pro Asn Gin Gly Leu Arg Val Lys Lys Glu Gly Thr 
85 90 95 

Ala Glu Ser Asp Thr Val Cys Thr Cys Lys Glu Gly Gin His Cys Thr 
100 105 110 

Ser Lys Asp Cys Glu Ala Cys Ala Gin His Thr Pro Cys He Pro Gly 
115 120 125 

Phe Gly Val Met Glu Met Ala Thr Glu Thr Thr Asp Thr Val Cys His 
130 135 140 

Pro Cys Pro Val Gly Phe Phe Ser Asn Gin Ser Ser Leu Phe Glu Lys 
145 150 155 160 

Cys Tyr Pro Trp Thr Ser Cys Glu Asp Lys Asn Leu Glu Val Leu Gin 
165 170 175 

Lys Gly Thr Ser Gin Thr Asn Val He Cys Gly Leu Lys Ser Arg Met 
180 185 190 

Arg Ala Leu Leu Val 
195 

I) INFORMATION FOR SEQ ID NO: 13 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 



Met Asn Lys Trp Leu Cys Cys Ala 
1 5 

Glu Trp Thr Thr Gin Glu Thr Phe 
20 



Leu Leu Val Phe Leu Asp He He 
10 15 

Pro Pro Lys Tyr Leu His Tyr Asp 
25 30 



Pro Glu Thr Gly Arg Gin Leu Leu Cys Asp Lys Cys Ala Pro Gly Thr 
35 40 45 
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15 



3 0 



Tyr Leu Lys Gin His Cys Thr Val Arg Arg Lys Thr Leu Cys Val Pro 
50 55 60 

Cys Pro Asp Tyr Ser Tyr Thr Asp Ser Trp His Thr Ser Asp Glu Cys 
S5 70 75 80 

Val Tyr Cys Ser Pro Val Cys Lys Glu Leu Gin Thr Val Lys Gin Glu 
85 90 95 

Cys Asn Arg Thr His Asn Arg Val Cys Glu Cys Glu Glu Gly Arg Tyr 
100 105 110 

Leu Glu Leu Glu Phe Cys Leu Lys His Arg Ser Cys Pro Pro Gly Leu 
115 120 125 

Gly Val Leu Gin Ala Gly Thr Pro Glu Arg Asn Thr Val Cys Lys Arg 
130 135 140 



Cys Pro Asp Gly Phe Phe Ser Gly Glu Thr Ser Ser Lys Ala Pro Cys 
20 145 150 155 160 

Arg Lys His Thr Asn Cys Ser Ser Leu Gly Leu Leu Leu lie Gin Lys 
165 170 175 

25 Gly Asn Ala Thr His Asp Asn Val Cys Ser Gly Asn Arg Glu Ala Thr 

180 185 190 



Gin Asn Cys Gly lie Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 134: 



(i) SEQUENCE CHARACTERISTICS: 
3 5 (A) LENGTH: 224 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

45 Met Gly Ala Gly Ala Thr Gly Arg Ala Met Asp Gly Pro Arg Leu Leu 

15 10 15 



Leu Leu Leu Leu Leu Gly Val Ser Leu Gly Gly Ala Lys Glu Ala Cys 
50 20 25 30 

Pro Thr Gly Leu Tyr Thr His Ser Gly Glu Cys Cys Lys Ala Cys Asn 

35 40 45 
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35 



45 



Leu Gly Glu Gly Val Ala Gin Pro Cys Gly Ala Asn Gin Thr Val Cys 
50 55 60 

Glu Pro Cys Leu Asp Ser Val Thr Phe Ser Asp Val Val Ser Ala Thr 
65 70 75 80 

Glu Pro Cys Lys Pro Cys Thr Glu Cys Val Gly Leu Gin Ser Met Ser 
85 90 95 

Ala Pro Cys Val Glu Ala Asp Asp Ala Val Cys Arg Cys Ala Tyr Gly 
100 105 110 



Tyr Tyr Gin Asp Glu Thr Thr Gly Arg Cys Glu Ala Cys Arg Val Cys 
15 115 120 125 

Glu Ala Gly Ser Gly Leu Val Phe Ser Cys Gin Asp Lys Gin Asn Thr 
130 135 140 

20 Val Cys Glu Glu Cys Pro Asp Gly Thr Tyr Ser Asp Glu Ala Asn His 

145 150 155 160 

Val Asp Pro Cys Leu Pro Cys Thr Val Cys Glu Asp Thr Glu Arg Gin 
165 170 175 

25 

Leu Arg Glu Cys Thr Arg Trp Ala Asp Ala Glu Cys Glu Glu He Pro 
180 185 190 

Gly Arg Trp He Thr Arg Ser Thr Pro Pro Glu Gly Ser Asp Ser Thr 
30 195 200 205 

Ala Pro Ser Thr Gin Glu Pro Glu Ala Pro Pro Glu Gin Asp Leu He 
210 215 220 



(2) INFORMATION FOR SEQ ID NO: 135: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 205 amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 



Met Tyr Val T2rp Val Gin Gin Pro Thr Ala Phe Leu Leu Leu Gly Leu 
50 1 5 10 15 



Ser Leu Gly Val Thr Val Lys Leu Asn Qys Val Lys Asp Thr Tyr Pro 
20 ' 25 30 
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10 



25 



45 



Ser Gly His Lys Cys Cys Arg Glu Cys Gin Pro Gly His Gly Met Val 
35 40 45 

Ser Arg Cys Asp His Thr Arg Asp Thr Val Cys His Pro Cys Glu Pro 
50 55 60 

Gly Phe Tyr Asn Glu Ala Val Asn Tyr Asp Thr Cys Lys Gin Cys Thr 
65 70 75 80 

Gin Cys Asn His Arg Ser Gly Ser Glu Leu Lys Gin Asn Cys Thr Pro 
85 90 95 



Thr Glu Asp Thr Val Cys Gin Cys Arg Pro Gly Thr Gin Pro Arg Gin 
15 100 105 110 

Asp Ser Ser His Lys Leu Gly Val Asp Cys Val Pro Cys Pro Pro Gly 
115 120 125 

20 His Phe Ser Pro Gly Ser Asn Gin Ala Cys Lys Pro Trp Thr Asn Cys 

130 135 140 

Thr Leu Ser Gly Lys Gin He Arg His Pro Ala Ser Asn Ser Leu Asp ' 

150 155 160 

Thr Val Cys Glu Asp Arg Ser Leu Leu Ala Thr Leu Leu Trp Glu Thr 
165 170 175 

Gin Arg Thr Thr Phe Arg Pro Thr Thr Val Pro Ser Thr Thr Val Trp 
30 180 185 190 

Pro Arg Thr Ser Gin Leu Pro Ser Thr Pro Thr Leu Val 
195 200 205 

35 (2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 

40 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

50 Met Gly Asn Asn Cys Tyr Asn Val Val Val He Val Leu Leu Leu Val 

15 10 15 
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10 



15 



25 



30 



35 



Gly Cys Glu Lys Val Gly Ala Val Gin Asn Ser Cys Asp Asn Cys Gin 
20 25 30 

Pro Gly Thr Phe Cys Arg Lys Tyr Asn Pro Val Cys Lys Ser Cys Pro 
35 40 45 

Pro Ser Thr Phe Ser Ser lie Gly Gly Gin Pro Asn Cys Asn lie Cys 
50 55 60 

Arg Val Cys Ala Gly Tyr Phe Arg Phe Lys Lys Phe Cys Ser Ser Thr 
65 70 75 80 

His Asn Ala Glu Cys Glu Cys He Glu Gly Phe His Cys Leu Gly Pro 
85 90 95 

Gin Cys Thr Arg Cys Glu Lys Asp Cys Arg Pro Gly Gin Glu Leu Thr 
100 105 110 



Lys Gin Gly Cys Lys Thr Cys Ser Leu Gly Thr Phe Asn Asp Gin Asn 
20 115 120 125 

Gly Thr Gly Val Cys Arg Pro Trp Thr Asn Cys Ser Leu Asp Gly Arg 
130 135 140 



Ser Val Leu Lys Thr Gly Thr Thr Glu Lys Asp Val Val Cys Gly Pro 
145 150 155 160 

Pro Val Val Ser Phe Ser Pro Ser Thr Thr lie Ser Val Thr Pro Glu 
165 170 175 

Gly Gly Pro Gly Gly His Ser Leu Gin Val Leu Thr Leu Phe Leu 
180 185 190 

(2) INFORMATION FOR SEQ ID NO: 137: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:' 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

TATGGATGAA GAAACTTCTC ATCAGCTGCT GTGTGATAAA TGTCCGCCGG GTAC 54 
(2) INFORMATION FOR SEQ ID NO: 138: 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 380 amino acids 

(B) TYPE: amino acid 
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15 



30 



45 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 



Glu Thr Leu Pro Pro Lys Tyr Leu His Tyr Asp Pro Glu Thr Gly His 
15 10 15 

Gin Leu Leu Cys Asp Lys Cys Ala Pro Gly Thr Tyr Leu Lys Gin His 
20 25 30 

Cys Thr Val Arg Arg Lys Thr Leu Cys Val Pro Cys Pro Asp His Ser 
35 40 45 

Tyr Thr Asp Ser Trp His Thr Ser Asp Glu Cys Val Tyr Cys Ser Pro 
20 50 55 60 

Val Cys Lys Glu Leu Gin Ser Val Lys Gin Glu Cys Asn Arg Thr His 
65 70 75 80 

25 Asn Arg Val Cys Glu Cys Glu Glu Gly Arg Tyr Leu Glu lie Glu Phe 

85 90 95 



Cys Leu Lys His Arg Ser Cys Pro Pro Gly Ser Gly Val Val Gin Ala 
100 105 110 

Gly Thr Pro Glu Arg Asn Thr Val Cys Lys Lys Cys Pro Asp Gly Phe 
115 120 125 



Phe Ser Gly Glu Thr Ser Ser Lys Ala Pro Cys He Lys His Thr Asn 
35 130 135 140 

Cys Ser Thr Phe .Gly Leu Leu Leu He Gin Lys Gly Asn Ala Thr His 
145 150 155 160 

40 Asp Asn Val Cys Ser Gly Asn Arg Glu Ala Thr Gin Lys Cys Gly He 

165 170 175 



Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg Phe Ala Val Pro Thr 
180 185 190 

Lys He He Pro Asn Trp Leu Ser Val Leu Val Asp Ser Leu Pro Gly 
195 200 205 



Thr Lys Val Asn Ala Glu Ser Val Glu Arg He Lys Arg Arg His Ser 
50 210 215 220 



Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu Trp Lys His Gin Asn 
225 230 235 240 
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Arg Asp Gin Glu Met Val Lys Lys He He Gin Asp He Asp Leu Cys 
245 250 255 

Glu Ser Ser Val Gin Arg His Leu Gly His Ser Asn Leu Thr Thr Glu 
260 265 270 

Gin Leu Leu Ala Leu Met Glu Ser Leu Pro Gly Lys Lys He Ser Pro 
275 280 285 

Glu Glu He Glu Arg Thr Arg Lys Thr Cys Lys Ser Ser Glu Gin Leu 
290 295 300 

Leu Lys Leu Leu Ser Leu Trp Arg He Lys Asn Gly Asp Gin Asp Thr 
305 310 315 320 

Leu Lys Gly Leu Met Tyr Ala Leu Lys His Leu Lys Thr Ser His Phe 
325 330 335 

Pro Lys Thr Val Thr His Ser Leu Arg Lys Thr Met Arg Phe Leu His 
340 345 350 

Ser Phe Thr Met Tyr Arg Leu Tyr Gin Lys Leu Phe Leu Glu Met He 
355 360 365 

Gly Asn Gin Val Gin Ser Val Lys He Ser Cys Leu 
370 375 380 

INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

Glu Thr Phe Pro Pro Lys Tyr Leu His Tyr Asp Glu Glu Thr Ser His 

15 10 15 

Gin Leu Leu Cys Asp Lys Cys Pro Pro Gly Thr Tyr Leu Lys Gin His 

20 25 30 

Cys Thr Ala Lys Trp Lys Thr Val Cys Ala Pro Cys Pro Asp His Tyr 

35 40 45 
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Tyr Thr Asp Ser Trp His Thr Ser Asp Glu Cys Leu Tyr Cys Ser Pro 
50 55 60 

Val Cys Lys Glu Leu Gin Tyr Val Lys Gin Glu Cys Asn Arg Thr His 
65 70 75 80 

Asn Arg Val Cys Glu Cys Lys Glu Gly Arg Tyr Leu Glu lie Glu Phe 
85 90 95 

Cys Leu Lys His Arg Ser Cys Pro Pro Gly Phe Gly Val Val Gin Ala 
100 105 110 

Gly Thr Pro Glu Arg Asn Thr Val Cys Lys Arg Cys Pro Asp Gly Phe 
115 120 125 

Phe Ser Asn Glu Thr Ser Ser Lys Ala Pro Cys Arg Lys His Thr Asn 
130 135 140 

Cys Ser Val Phe Gly Leu Leu Leu Thr Gin Lys Gly Asn Ala Thr His 
145 150 155 160 

Asp Asn lie Cys Ser Gly Asn Ser Glu Ser Thr Gin Lys Cys Gly lie 
165 170 175 

Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg Phe Ala Val Pro Thr 
180 185 190 

Lys Phe Thr Pro Asn Trp Leu Ser Val Leu Val Asp Asn Leu Pro Gly 
195 200 205 

Thr Lys Val Asn Ala Glu Ser Val Glu Arg lie Lys Arg Gin His Ser 
210 215 220 

Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu Trp Lys His Gin Asn 
225 230 235 240 

Lys Ala Gin Asp lie Val Lys Lys lie lie Gin Asp lie Asp Leu Cys 
245 250 255 

Glu Asn Ser Val Gin Arg His lie Gly His Ala Asn Leu Thr Phe Glu 
260 265 270 

Gin Leu 'Arg Ser Leu Met Glu Ser Leu Pro Gly Lys Lys Val Gly Ala 
275 280 285 

Glu Asp lie Glu Lys Thr lie Lys Ala Cys Lys Pro Ser Asp Gin lie 
290 295 300 

Leu Lys Leu Leu Ser Leu Trp Arg lie Lys Asn Gly Asp Gin Asp Thr 
305 310 315 320 



Leu Lys Gly Leu Met His Ala Leu Lys His Ser Lys Thr Tyr His Phe 
325 . 330 335 
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10 



20 



25 



Pro Lys Thr Val Thr Gin Ser Leu Lys Lys Thr lie Arg Phe Leu His 
340 345 350 

Ser Phe Thr Met Tyr Lys Leu Tyr Gin Lys Leu Phe Leu Glu Met lie 
355 360 365 

Gly Asn Gin Val Gin Ser Val Lys lie Ser Cys Leu 
370 375 380 

(2) INFORMATION FOR SEQ ID NO: 140: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 
TGGACCACCC AGAAGTACCT TCATTATGAC 30 
(2) INFORMATION FOR SEQ ID NO: 141: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) .STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

40 

GTCATAATGA AGGTACTTCT GGGTGGTCCA 30 
(2) INFORMATION FOR SEQ ID NO: 142: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



50 



{ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION* SEQ ID NO: 142: 
GGACCACCCA GCTTCATTAT GACGAAGAAA C 31 
5 (2) INFORMATION FOR SEQ ID NO: 143: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

20 GTTTCTTCGT CATAATGAAG CTGGGTGGTC C 31 

(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

35 

GTGGACCACC CAGGACGAAG AAACCTCTC 2 9 

(2) INFORMATION FOR SEQ ID NO: 145: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single* 

(D) TOPOLOGY: linear 



45 



(ii) MOLECULE TYPE: cDNA 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
GAGAGGTTTC TTCGTCCTGG GTGGTCCAC 



29 
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(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

15 

CGTTTCCTCC AAAGTTCCTT CATTATGAC 29 
(2) INFORMATION FOR SEQ ID NO: 147: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: cDNA 



3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

GTCATAATGA AGGAACTTTG GAGGAAACG 2 9 

(2) INFORMATION FOR SEQ ID NO: 148: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 
GGAAACGTTT CCTGCAAAGT ACCTTCATTA TG 32 

50 

(2) INFORMATION FOR SEQ ID NO: 149: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: cDNA 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 
CATAATGAAG GTACTTTGCA GGAAACGTTT CC 32 
15 (2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



30 



(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

CACGCAAAAG TCGGGAATAG ATGTCAC 27 
(2) INFORMATION FOR SEQ ID NO: 151: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
3 5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 
GTGACATCTA TTCCCGACTT TTGCGTG 27 
(2) INFORMATION FOR SEQ ID NO: 152: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



5 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152; 

10 CACCCTGTCG GAAGAGGCCT TCTTC 25 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE; cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

25 GAAGAAGGCC TCTTCCGACA GGGTG 25 

(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

3 5 (ii) MOLECULE TYPE: cDNA 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

TGACCTCTCG GAAAGCAGCG TGCA 24 
(2) INFORMATION FOR SEQ ID NO: 155: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

5 

TGCACGCTGC TTTCCGAGAG GTCA 24 
(2) INFORMATION FOR SEQ ID NO: 156: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: cDNA 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

CCTCGAAATC GAGCGAGCAG CTCC 24 
(2) INFORMATION FOR SEQ ID NO: 157: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 
3 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 
CGATTTCGAG GTCTTTCTCG TTCTC 25 
(2) INFORMATION FOR SEQ ID NO: 15 8: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



50 
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(xi) SEQUENCE DESCRIPTION:* SEQ ID NO: 158: 
CCGTGAAAAT AAGCTCGTTA TAACTAGGAA TOG 33 
5 (2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



15 



20 



30 



35 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

CCATTCCTAG TTATAACGAG CTTATTTTCA CGG 33 
(2) INFORMATION FOR SEQ ID NO: 160: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 
CCTCTGAGCT CAAGCTTCCG AGGACCACAA TGAACAAG 38 
(2) INFORMATION FOR SEQ ID NO: 161: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single* 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 
CCTCTCTCGA GTCAGGTGAC ATCTATTCCA CACTTTTGCG TGGC 44 
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{2} INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(0) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: cDNA 



15 



20 



30 



35 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 
CCTCTGAGCT CAAGCTTCCG AGGACCACAA TGAACAAG 3 8 

(2) INFORMATION FOR SEQ ID NO: 163: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 base pairs 
2 5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 163: 

CCTCTCTCGA GTCAAGGAAC AGCAAACCTG AAGAAGGC 3 8 

(2) INFORMATION FOR SEQ ID NO: 164: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 
CCTCTGAGCT CAAGCTTCCG AGGACCACAA TGAACAAG 3 8 

(2) INFORMATION FOR SEQ ID NO: 165: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

CCTCTCTCGA GTCACTCTGT GGTGAGGTTC GAGTGGCC 3 8 

15 (2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

2 0 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

3 0 CCTCTGAGCT CAAGCTTCCG AGGACCACAA TGAACAAG 38 

(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 
3 5 (A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: cDNA 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

CCTCTCTCGA GTCAGGATGT TTTCAAGTGC TTGAGGGC 
(2) INFORMATION FOR SEQ ID NO: 168: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 



10 



Met Lys His His His His His His His Ala Ser Val Asn Ala Leu Glu 
15 10 15 
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WHAT IS CLAIMED IS: 

1. An isolated nucleic acid encoding a 
polypeptide comprising at least one of the biological 
activities of OPG wherein the nucleic acid is selected 
from the group consisting of: 

a) the nucleic acids shown in Figures 
2B-2C (SEQ ID NO:120), 9A-9B (SEQ ID NO:122), and 9C-9D 
(SEQ ID NO: 124) or complementary strands thereof; 

b) nucleic acids which hybridize under 
stringent conditions with the polypeptide-encoding 
regions as shown in Figures 2B-2C (SEQ ID NO:120), 9A- 
9B (SEQ ID NO:122) and 9C-9D (SEQ ID NO:124); 

c) nucleic acids which hybridize under 
stringent conditions with nucleotides 148 through 337 
inclusive as shown in Figure lA; and 

d) nucleic acid which are degenerate to 
the nucleic acids of (a), (b) and (c) . 

2. The nucleic acid of Claim 1 which is 
cDNA, genomic DNA, synthetic DNA or RNA. 

3. A polypeptide encoded by the nucleic acid 
25 of Claim 1. 

4. The nucleic acid of Claim 1 including one 
or more codons preferred for Escherichia coli 
expression. 

30 

5 . The nucleic acid of Claim 1 having a 
detectable label attached thereto. 

6. The nucleic acid of Claim 1 comprising 

3 5 the polypeptide-encoding region of Figure 2B-2C (SEQ ID 
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NO:120), Figure 9A-9B (SEQ ID NO:122) or Figure 9C-9D 
(SEQ ID NO: 124) . 



7. The nucleic acid of Claim 6 having the 
5 sequence as shown in Figure 9C-D (SEQ ID NO: 124) from 
nucleotides 158-1297. 



8. An expression vector comprising the 
nucleic acid of Claim 1. 

10 

9, The expression vector of Claim 8 wherein 
the nucleic acid comprises the polypeptide - encoding 
region as shown in Figure 9C-9D (SEQ ID NO:124), 

15 10, A host cell transformed or transfected 

with the expression vector of Claim 8. 

11, The host cell of Claim 10 which is a 
eucaryotic cell . 

20 

12, The host cell of Claim 11 which is 
selected from the group consisting of CHO, COS, 293, 
3T3, CV-1 and BHK cells, 

25 13. The host cell of Claim 10 which is a 

procaryotic cell . 



14. The host cell of Claim 13 which is 
Escherichia coli , 

30 

15 . A transgenic mammal comprising the 
expression vector of Claim 8, 



16. The transgenic mammal of Claim 15 which 
35 is a rodent. 
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17. The transgenic mammal of Claim 15 which 
is a mouse, 

18. A process for the production of OPG 
5 comprising: 

growing under suitable nutrient 
conditions host cells transformed or transfected with 
the nucleic acid of Claim 1; and 

isolating the polypeptide products of 
10 the expression of the nucleic acids. 

19. A purifed and isolated polypeptide 
comprising OPG. 

15 20. The polypeptide of Claim 19 which is 

mammalian OPG. 

21. The polypeptide of Claim 20 which is 

human OPG. 

20 

22. The polypeptide of Claim 19 which is 
substantially free of other human proteins. 

23. The polypeptide of Claim 21 having the 
25 amino acid sequence as shown in Figure 2B-2C (SEQ ID 

N0:121), Figure 9A-9B (SEQ ID NO:123), or Figure 9C-9D 
(SEQ ID NO: 125) or a derivative thereof. 

24. The polypeptide of Claim 23 having the 
3 0 amino acid sequence as shown in Figure 9C-9D (SEQ ID 

NO: 125) from residues 22-401 inclusive. 



35 



25. The polypeptide of Claim 23 having the 
amino acid sequence as shown in Figure 9C-9D (SEQ ID 
NO: 125) from residues 32-401 inclusive. 
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26. The polypeptide of Claim 19 which is 
characterized by being a product of expression of an 
exogenous DNA sequence. 

27. The polypeptide of Claim 26 wherein the 
DNA is cDNA, genomic DNA or synthetic DNA. 

28. The polypeptide of Claim 19 which has 
been modified with a water-soluble polymer. 

29. The polypeptide of Claim 28 wherein the 
water soluble polymer is polyethylene glycol , 

30. A polypeptide comprising: 

15 an amino acid sequence of at least about 164 

amino acids comprising four cysteine-rich domains 
characteristic of the cysteine rich domains of tumor 
necrosis factor receptor extracellular regions; and 
an activity of increasing bone density. 

20 

31. A polypeptide comprising the amino acid 
sequence as shown in Figure 2B-2C (SEQ ID NO: 121), 
Figure 9A-9B (SEQ ID NO: 123) or Figure 9C-9D (SEQ ID 
NO: 125) having an amino terminus at residue 22, and 

25 wherein from 1 to 216 amino acids are deleted from the 
carboxy terminus. 

32. The polypeptide of Claim 31 comprising 
the amino acid sequence from residues 22-185, 22-189, 

30 22-194, or 22-201 inclusive. 

33. The polypeptide of Claim 32 further 
comprising an Fc region of human IgGl extending from 
the carboxy terminus , 

35 
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34. A polypeptide comprising the amino acid 
sequence as shown in Figure 2B-2C (SEQ ID N0:121), 
Figure 9A-9B (SEQ ID NO: 123) or Figure 9C-9D (SEQ ID 
NO: 125) having an amino terminus at residue 22, wherein 

5 from 1 to 10 amino acids are deleted from the amino 

terminus and, optionally, from 1 to 215 amino acids are 
deleted from the carboxy terminus. 

35. The polypeptide of Claim 34 comprising' 
10 the amino acid sequence from residues 27-185, 27-189, 

27-194, 27-401, or 32-401 inclusive. 

36. The polypeptide of Claim 3 5 further 
comprising an Fc region of hioman IgGl extending from 

15 the carboxy terminus. 

37. A polypeptide selected from the group 
consisting of: 





huOPG 


[22-201] -Fc 


20 


huOPG 


[22-401] -Fc 




huOPG 


[22-180]-Fc 




huOPG 


met [22-401] -Fc 




huOPG 


Fc-met [22-401] 




huOPG 


met [22-185] 


25 


huOPG 


met [22-189] 




huOPG 


met [22-194] 




huOPG 


met [27-185] 




huOPG 


met [27-189] 




huOPG 


met [27-194] 


30 


huOPG 


met [32-401] 




huOPG 


met-lys [22-401] 




huOPG 


met [22-401] 




huOPG 


met [22-401] -Fc (P25A) 




huOPG 


met [22-401] (P25A) 


35 


huOPG 


met [22-401] .{P26A) 




huOPG 


met ['22-401] (P26D) 
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huOPG met [22-194] {P25A) 

huOPG met [22-194] (P26A) 

huOPG met met-(lys)3 [22-401] 

huOPG met met-arg-gly-ser- (his ) g [22-401] 

38. A nucleic acid encoding the polypeptide 
of Claim 37. 

39. An antibody or fragment thereof which 
specifically binds to OPG. 

40. The antibody of Claim 39 which is a 
monoclonal antibody. 

41. A method for detecting the presence of 
OPG in a biological sample comprising: 

incubating the sample with the antibody ot 
Claim 39 under conditions that allow binding of the 
antibody to OPG; and 

detecting the bound antibody. 
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42. A method to assess the ability of a 
candidate substance to bind to OPG comprising: 

incubating OPG with the candidate substance 
5 under conditions that allow binding; and 
measuring the bound substance. 

43 . A method of regulating the levels of OPG 
in an animal comprising modifying the animal with a 

10 nucleic acid encoding OPG. 

44 . The method of Claim 43 wherein the 
nucleic acid promotes an increase in the tissue level 
of OPG, 

45. The method of Claim 44 wherein the 
animal is a human* 

46. A pharmaceutical composition comprising 
a therapeutically effective amount of OPG in a 
pharmaceutically acceptable carrier, adjuvant, 
solubilizer, stabilizer and/or anti-oxidant . 

47. The composition of Claim 46 wherein the 
25 OPG is human OPG. 

48. The composition of Claim 47 wherein the 
OPG has the amino acid sequence as shown in Figure 9B. 

3 0 49. A method of treating a bone disorder 

comprising administering a therapeutically effective 
amount of the polypeptide of Claim 19. 

50. The method of Claim 49 wherein the 
3 5 polypeptide is hiiman OPG. 
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51. The method of Claim 49 wherein the bone 
disorder is excessive bone loss. 

52. The method of Claim 51 wherein the bone 
5 disorder is selected from the group consisting of 

osteoporosis, Paget 's disease of bone, hypercalcemia, 
hyperparathyroidism, steroid- induced osteopenia, bone 
loss due to rheumatoid arthritis, bone loss due to 
osteomyelitis, osteolytic metastasis, and periodontal 
10 bone loss. 

53 . The method of Claim 49 further 
comprising administering a therapeutically effective 
amount of a substances selected from the group 

15 consisting of bone morphogenic proteins BMP-1 through 
BMP-12, TGF-P family members, IL-1 inhibitors, TNFa 
inhibitors, parathyroid hormone and analogs thereof, 
parathyroid hormone related protein and analogs 
thereof, E series prostaglandins, bisphosphonates, and 

2 0 bone- enhancing minerals. 

54 . An osteoprotegerin multimer consisting 
of osteoprotegerin monomers. 

25 55, The multimer of Claim 54 which is a 

dimer . 

56. The multimer of Claim 54 formed by 
interchain disulfide bonds, 

30 

57. The multimer of Claim 54 formed by 
association Fc regions derived from human IgGl . 

58. The multimer of Claim 54 which is 
35 essentially free of osteoprotegerin monomers and 

inactive multimers. 
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59, The multimer 
monomers comprise the amino 
Figure 9C-9D (SEQ ID NO: 12 5) 
5 derivative thereof. 



of Claim 54 wherein the 
acid sequence as shown in 
from residues 22-401, or a 



60, The multimer of Claim 54 wherein the 
monomers comprise the amino acid sequence as shown in 
Figure 9C-9D (SEQ ID NO: 125) from residues 22-194, 
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ABSTRACT OF THE INVENTION 

The present invention discloses a novel 
secreted polypeptide, termed osteoprotegerin, which is 
5 a member of the ttimor necrosis factor receptor 

superfamily and is involved in the regulation of bone 
metabolism. Also disclosed are nucleic acids encoding 
osteoprotegerin, polypeptides, recombinant vectors and 
host cells for expression, antibodies which bind OPG, 
10 and pharmaceutical compositions. The polypeptides are 
used to treat bone diseases characterized by increased 
resorption such as osteoporosis. 
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FIG.2B 



10 30 50 

ATCAAAGGCAGGGCATACTTCCTGTTGCCCAGACCTTATATAAAACGTCATGTTCGCCTG 

70 90 110 

GGCAGCAGAGAAGCACCTAGCACTGGCCCAGCGGCTGCCGCCTGAGGTTTCCAGAGGACC 

130 150 170 

ACAATGAACAAGTGGCTGTGCTGTGCACTCCTGGTGTTCTTGGACATCATTGAATGGACA 
M E K JS L Q CALLVFLDIIEWT 

190 210 230 

ACCCAGGAAACCTTTCCTCCAAAATACTTGCATTATGACCCAGAAACCGGACGTCAGCTC 
TO E TFPPKYLHYDPETGRQL 

250 270 290 

TTGTGTGACAAATGTGCTCCTGGCACCTACCTAAAACAGCACTGCACAGTCAGGAGGAAG 
LCDKCAPGTYLKQHCTVRRK 

310 330 350 

ACACTGTGTGTCCCTTGCCCTGACTACTCTTATACAGACAGCTGGCACACGAGTGATGAA 
TLCVPCPDYSYTDSWHTSDE 

370 390 410 

TGCGTGTACTGCAGCCCCGTGTGCAAGGAACTGCAGACCGTGAAACAGGAGTGCAACCGC 

CVYCSPVCKELQTVKQECMR 
430 450 470 

ACCCACAACCGAGTGTGCGAATGTGAGGAAGGGCGCTACCTGGAGCTCGAATTCTGCTTG 
THNRVCECEEGRYLELEFCL 

490 510 530 

AAGCACCGGAGCTGTCCCCCAGGCTTGGGTGTGCTGCAGGCTGGGACCCCAGAGCGAAAC 
KHRSCPPGLGVLQAGTPERN 

550 570 590 

ACGGTTTGCAAAAGATGTCCGGATGGGTTCTTCTCAGGTGAGACGTCATCGAAAGCACCC 
TVCKRCPDGFFSGETSSKAP 

610 630 650 

TGTAGGAAACACACCAACTGCAGCTCACTTGGCCTCCTGCTAATTCAGAAAGGAAATGCA 
CRKHTMCSS LGLLLIQKGHA 

670 690 710 

ACACATGACAATGTATGTTCCGGAAACAGAGAAGCAACTCAAAATTGTGGAATAGATGTC 
THDNVCSGNREATQNCGIDV 

730 750 770 

ACCCTGTGCGAAGAGGCATTCTTCAGGTTTGCTGTGCCTACCAAGATTATACCGAATTGG 
TLCEEAFFRFAVPTKIIPNW 

790 810 830 

CTGAGTGTTCTGGTGGACAGTTTGCCTGGGACCAAAGTGAATGCAGAGAGTGTAGAGAGG 
LSVLVDSLPGTKVNAESVER 

850 870 890 

ATAAAACGGAGACACAGCTCGCAAGAGCAAACTTTCCAGCTACTTAAGCTGTGGAAGCAT 
IKRRHSSQEQTFQLLKLWKH 

910 930 950 

CAAAACAGAGACCAGGAAATGGTGAAGAAGATCATCCAAGACATTGACCTCTGTGAAAGC 
QNRDQEMVKKIIQDIDLCES 

970 990 1010 

AGTGTGCAACGGCATATCGGCCACGCGAACCTCACCACAGAGCAGCTCCGCATCTTGATG 
SVQRHIGHAHI'TTEQLRILM 



FIG.2C 



1030 1050 1070 

GAGAGCTTGCCTGGGAAGAAGATCAGCCCAGACGAGATTGAGAGAACGAGAAAGACCTGC 
ESLPGKKISPDEIERTRKTC 

1090 1110 1130 

AAACCCAGCGAGCAGCTCCTGAAGCTACTGAGCTTGTGGAGGATCAAAAATGGAGACCAA 
KPSEQLLKLLSLWRIKNGDQ 
1150 1170 1190 

GACACCTTGAAGGGCCTGATGTACGCACTCAAGCACTTGAAAGCATACCACTTTCCCAAA 
DTLKGLMYALKHLKAYHFPK 
1210 1230 1250 

ACCGTCACCCACAGTCTGAGGAAGACCATCAGGTTCTTGCACAGCTTCACCATGTACCGA 
TVTHSLRKTIRFLHSFTMYR 
1270 1290 1310 

TTGTATCAGAAACTCTTTCTAGAAATGATAGGGAATCAGGTTCAATCAGTGAAGATAAGC 
LYQKLFLEMIGNQVQSVKIS 
1330 1350 1370 

TGCTTATAGTTAGGAATGGTCACTGGGCTGTTTCTTCAGGATGGGCCAACACTGATGGAG 
C L 

1390 1410 1430 

CAGATGGCTGCTTCTCCGGCTCTTGAAATGGCAGTTGATTCCTTTCTCATCAGTTGGTGG 
1450 1470 1490 

GAATGAAGATCCTCCAGCCCAACACACACACTGGGGAGTCTGAGTCAGGAGAGTGAGGCA 
1510 1530 1550 

GGCTATTTGATAATTGTGCAAAGCTGCCAGGTGTACACCTAGAAAGTCAAGCACCCTGAG 
1570 1590 1610 

AAAGAGGATATTTTTATAACCTCAAACATAGGCCCTTTCCTTCCTCTCCTTATGGATGAG 
1630 1650 1670 

TACTCAGAAGGCTTCTACTATCTTCTGTGTCATCCCTAGATGAAGGCCTCTTTTATTTAT 
1690 1710 1730 

TTTTTTATTCTTTTTTTCGGAGCTGGGGACCGAACCCAGGGCCTTGCGCTTGCGAGGCAA 
1750 1770 1790 

GTGCTCTACCACTGAGCTAAATCTCCAACCCCTGAAGGCCTCTTTCTTTCTGCCTCTGAT 
1810 1830 1850 

AGTCTATGACATTCTTTTTTCTACAATTCGTATCAGGTGCACGAGCCTTATCCCATTTGT 
1870 1890 1910 

AGGTTTCTAGGCAAGTTGACCGTTAGCTATTTTTCCCTCTGAAGATTTGATTCGAGTTGC 
1930 1950 1970 

AGACTTGGCTAGACAAGCAGGGGTAGGTTATGGTAGTTTATTTAACAGACTGCCACCAGG 
1990 2010 2030 

AGTCCAGTGTTTCTTGTTCCTCTGTAGTTGTACCTAAGCTGACTCCAAGTACATTTAGTA 
2050 2070 2090 

TGAAAAATAATCAACAAATTTTATTCCTTCTATCAACATTGGCTAGCTTTGTTTCAGGGC 
2110 2130 2150 

ACTAAAAGAAACTACTATATGGAGAAAGAATTGATATTGCCCCCAACGTTCAACAACCCA 
2170 2190 2210 

ATAGTTTATCCAGCTGTCATGCCTGGTTCAGTGTCTACTGACTATGCGCCCTCTTATTAC 
2230 2250 2270 

TGCATGCAGTAATTCAACTGGAAATAGTAATAATAATAATAGAAATAAAATCTAGACTCC 
2290 2310 2330 

ATTGGATCTCTCTGAATATGGGAATATCTAACTTAAGAAGCTTTGAGATTTCAGTTGTGT 
2350 2370 2390 

TAAAGGCTTTTATTAAAAAGCTGATGCTCTTCTGTAAAAGTTACTAATATATCTGTAAGA 
2410 2430 

CTATTACAGTATTGCTATTTATATCCATCCAG 
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FIG.9A 



10 30 50 

CCTTATATAARACGTCATGATTGCCTGGGCTGCAGAGACGCACCTAGCACTGACCCAGCG 
70 90 110 

GCTGCCTCCTGAGGTTTCCCGAGGACCACAATGAACAAGTGGCTGTGCTGCGCACTCCTG 

n H K W L r C A T. T. 

130 150 170 

GTGCTCCTGGACATCATTGAATGGACAACCCAGGAAACCCTTCCTCCAAAGTACTTGCAT 
y — L — L Q I I E w 'I' T n f: T L P p k Y L H 

190 210 230 

TATGACCCAGAAACTGGTCATCAGCTCCTGTGTCACAAATGTGCTCCTGGCACCTACCTA 
YDPETGHQLLCDKCAPGTYL. 

250 270 290 

AAACAGCACTGCACAGTGAGGAGGAAGACATTGTGTGTCCCTTGCCCTGACCACTCTTAT 
KQHCTVRRKTLCVPCPDHSY 

310 330 350 

ACGGACAGCTGGCACACCAGTGATGAGTGTGTGTATTGCAGCCCAGTGTGCAAGGAACTG 
TDSWHTSDECVYCSPVCKEL 

370 390 410 

CAGTCCGTGAAGCAGGAGTGCAACCGCACCCACAACCGAGTGTGTGAGTGTGAGGAAGGG 
QSVKQECHRTHNRVCECEEG 

430 450 470 

CGTTACCTGGAGATCGAATTCTGCTTGAAGCACCGGAGCTGTCCCCCGGGCTCCGGCGTG 
RYLEIEFCLKHRSCPPGSGV 

490 510 530 

GTGCAAGCTGGAACCCCAGAGCGAAACACAGTTTGCAAAAAATGTCCAGATGGGTTCTTC 
VQAGTPERNT,VCKKCPDGFF 

550 570 590 

TCAGGTGAGACTTCATCGAAAGCACCCTGTATAAAACACACGAACTGCAGCACATTTGGC 
SGETSSKAPCIKHTJICSTFG 

610 630 650 

CTCCTGCTAATTCAGAAAGGAAATGCAACACATGACAACGTGTGTTCCGGAAACAGAGAA 
LLLIQKGHATHDNVCSGNRE 

670 690 710 

GCCACGCAAAAGTGTGGAATAGATGTCACCCTGTGTGAAGAGGCCTTCTTCAGGTTTGCT 
ATQKCGIDVTLCEEAFFRFA 

730 750 770 

GTTCCTACCAAGATTATACCAAATTGGCTGAGTGTTTTGGTGGACAGTTTGCCTGGGACC 
VPTKI IPNWLSVLVDSLPGT 



FIG.9B 



790 810 830 

AAAGTGAATGCCGAGAGTGTAGAGAGGATAAAACGGAGACACAGCTCACAAGAGCAAACC 
KVNAESVERIKRRHSSQEQT 

850 870 890 

TTCCAGCTGCTGAAGCTGTGGAAACATCAAAACAGAGACCAGGAAATGGTGAAGAAGATC 
FQLLKLWKHQNRDQEMVKKI 
910 930 950 

ATCCAAGACATTGACCTCTGTGAAAGCAGCGTGCAGCGGCATCTCGGCCACTCGAACCTC 
IQDIDLCESSVQRHLGHSML 
970 990 1010 

ACCACAGAGCAGCTTCTTGCCTTGATGGAGAGCCTGCCTGGGAAGAAGATCAGCCCAGAA 
TTEQLLALMESLPGKKISPE 
1030 1050 1070 

GAGATTGAGAGAACGAGAAAGACCTGCAAATCGAGCGAGCAGCTCCTGAAGCTACTCAGT 
EIERTRKTCKSSEQLLKLLS 
1090 1110 H30 

TTATGGAGGATCAAAAATGGTGACCAAGACACCTTGAAGGGCCTGATGTATGCCCTCAAG 
I. WRIKNGDQDTLKGLMYALK 
1150 1170 1190 

CACTTGAAAACATCCCACTTTCCCAAAACTCTCACCCACAGTCTGAGGAAGACCATGAGG 
HLKTSHPPKTVTHSLRKTMR 
1210 1230 1250 

TTCCTGCACAGCTTCACAATGTACAGACTGTATCAGAAGCTCTTTTTAGAAATGATAGGG 
FLHSFTMYRLYQKLFLEMIG 
1270 1290 1310 

AATCAGGTTCAATCCGTGAAAATAAGCTOCTTATAACTAGGAATGGTCACTGGGCTGTTT 
NQVQSVKISCL 



CTTCA 



FIG.9C 



10 30 50 

GTATATATAACGTGATGAGCGTACGGGTGCGGAGACGCACCGGAGCGCTCGCCCAGCCGC 

70 90 110 

CGYCTCCAAGCCCCTGAGGTTTCCGGGGACCACAATGAACAAGTTGCTGTGCTGCGCGCT 

M U K L L C C A u 

130 150 170 

CGTGTTTCTGGACATCTCCATTAAGTGGACCACCCAGGAAACGTITCCTCCAAAGTACCT 
VPLD ISIKWTTQE T F P P K Y L 
190 210 230 

TCATTATGACGAAGAAACCTCTCATCAGCTGTTGTGTGACAAATGTCCTCCTGGTACCTA 
HYDEETSHQLLCDKCPPGTY 
250 270 290 

CCTAAAACAACACTGTACAGCAAAGTGGAAGACCGTGTGCGCCCCTTGCCCTGACCACTA 
LKQHCTAKWKTVCAPCPDHY 
310 330 350 

CTACACAGACAGCTGGCACACCAGTGACGAGTGTCTATACTGCAGCCCCGTGTGCAAGdA 
YTDSWHTSDECLYCSPVCKE 
370 390 410 

GCTGCAGTACGTCAAGCAGGAGTGCAATCGCACCCACAACCGCGTGTGCGAATGCAAGGA 
LQYVKQBCHRTHNRVCECKE 
430 450 470 

AGGGCGCTACCTTGAGATAGAGTTCTGCTTGAAACATAGGAGCTGCCCTCCTGGATTTGG 
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AGGTAACCAGGTCCAATCAGTAAAAATAAGCTGCTTATAACTGGAAATGGCCATTGAGCT 
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